Template: Hub/Link
When writing about satellite loading, we need also to talk about hub and link loading. They happen at the same time. Before Data Vault 2.0 a hub or link surrogate key was an integer. Nowadays modern data vaulters would use hash keys instead to allow parallel loading. Others are going back to the roots and […]
Template: Satellite
Finally, we reach our goal of talking about Data Vault Objects. This is the new area which I’m working and learning from and probably you might be interested as well. Data Vault consists of mainly 3 objects: Hub Link Satellite I’m not going to discuss what Data Vault is. There are lots of resources in […]
Template: History (DataStore)
Creating a history layer (or Persistent Staging Area from Roeland Vos) is a safeguarding layer. All data arrives unmodified from the source. People in the Data Vault Space would probably suggest creating a Source Vault if I don’t want to build a proper Raw Vault with good Business Keys. I considered that option too. I […]
Template: Stage (View)
What is the fastest way to ingest data? To persist or to stream? A view comes close to streaming data. Persisting is always kind of stop-and-go in my view. I wanted to wait until the end of my little template story. But I couldn’t resist to talk about it now. I’m too excited about it! […]
Template: Stage
Usually for a start, somebody starts with an easy template. The stage table is the most difficult one. The first running version was done in a few days. But perfecting it and adding more features has taken us months! The stage table is used everywhere. So, I’m starting with it. Where do we use a […]
WhereScape
WhereScape consists of 2 applications: WhereScape 3D WhereScape RED WhereScape 3D is a newer application and is the way to go for the future. With 3D I can dig into my source system, profile it, document it and design my future design and at the end it spills out any objects (including any needed intermediate […]
Business Intelligence Automation
How to build a data warehouse? I used to work with SSIS on Microsoft SQL Server to generate all my pipelines. This was fast and convenient and a lot of people in the industry are working like that. Define a source, do some transformations and load them to a target table. Very easy, very fast. […]
Data Loading
In my previous post I documented roughly a data warehouse and some data modeling styles. In this post I want to discuss, how I came to the conclusion how I want to ingest my data and serve them to my user. So what we know for sure is, that we have some sources and our […]
Universal Loading Pattern Reloaded
There was quite some noise, when I published last year my blog post about Universal Loading Pattern, a common view how to process data to any target. Be it a Data Vault Hub or a Fact table. Since then I have worked with this pattern happily without any incidents. Some time ago I thought, well, […]
By ecki • Business Intelligence 0