Stack

At Datafitters, we are specialized in the Microsoft stack for data products, with technical knowledge and years of experience in:

BUSINESS INTELLIGENCE

Data Warehouse development (Kimball)

Utilizing the Kimball methodology for constructing data warehouses that are optimized for reporting and analyzing data.

ETL

Short for Extract, Transform, Load, it's a process that involves extracting data from various sources, transforming it into a structured format, and loading it into a final target database.

Medallion Architecture

A layered architectural approach for data management in a Data Lake, consisting of bronze, silver, and gold layers, representing raw, cleaned, and business-ready data, respectively.


AZURE CLOUD

Data Factory

A cloud-based data integration service that allows you to create, schedule, and orchestrate your ETL/ELT workflows.

SQLDB

Azure SQL Database, a fully-managed database service with built-in intelligence to automate tasks like performance tuning and threat detection.

Analysis Services

Azure Analysis Services provides enterprise-grade data modeling in the cloud.

Data Lake Storage

Highly scalable and secure data storage for big data analytics workloads.

LogicApps

A cloud service that helps you schedule, automate, and orchestrate tasks, business processes, and workflows when you need to integrate apps, data, systems, and services across enterprises or organizations.

Functions

An event-driven, compute-on-demand experience that extends the existing Azure application platform with capabilities to implement code triggered by events occurring in Azure or third-party service.

Delta Lake

An open-source storage layer that brings reliability to Data Lakes, ensuring ACID transactions and efficient upserts and deletions.

PySpark

The Python API for Apache Spark, used for big data processing and analysis, which allows for easy integration with Python libraries and data processing workflows.


BIG DATA

Spark

Apache Spark is an open-source distributed general-purpose cluster-computing framework, mainly used for big data processing and analytics.

DataBricks

A platform service that provides cloud-based big data processing using Apache Spark.

Lakehouse

A data management architecture that combines elements of data lakes and data warehouses, aiming to provide the scalability and flexibility of the former with the data management features of the latter.


PROGRAMMING

Python

A versatile programming language that is widely used for web development, scripting, scientific computing, and artificial intelligence.


ANALYSIS AND REPORTING

Power BI

A suite of business analytics tools that deliver insights throughout your organization.

DAX

Data Analysis Expressions, a library of functions and operators used to build formulas and expressions in Power BI, Analysis Services, and Power Pivot.

T-SQL

Transact-SQL is Microsoft's and Sybase's proprietary extension to SQL used to interact with relational databases.


ON PREMISES

SQL Server

A relational database management system that is used to store and retrieve data as requested by other software applications, with tools for optimizing performance.

SSIS

SQL Server Integration Services, a platform for building enterprise-level data integration and data transformations solutions.

SAS (cubes)

SQL Server Analysis Services, used to analyze data through multidimensional cubes that allow for complex queries and data analysis.

SSRS

SQL Server Reporting Services, a system for designing, deploying, and managing reports.