Category: Data Engineering

  • Building a Data Pipeline Databricks

    The attached document contains the assignment description and questions. Please submit your answers by uploading the following:

    1. A Databricks notebook file (i.e. ipynb) that has all the code and output results of the code.

    2. One PDF document as a short report.

  • Building a Data Pipeline in DataBricks

    The attached document contains the assignment description and questions. Please submit your answers by uploading the following:

    1. A Databricks notebook file (i.e. ipynb) that has all the code and output results of the code.

    2. One PDF document as a short report.

  • Explain the difference between ETL(Extract transform load)an…

    The Workflow Mechanics: Clearly defining the order of operations. ETL performs the transformation in a dedicated processing engine before moving data to the destination, while ELT loads raw data directly into the target system and relies on that system’s compute power to transform it.

    Infrastructure Context: Explaining that ETL is traditionally used with legacy, on-premise data warehouses that have limited storage and compute, whereas ELT has become the standard for modern cloud data warehouses (like BigQuery, Snowflake, or Redshift) that scale compute and storage separately

  • We have know the basic of data

    so firstly know what is data ok that we have to know another thing

  • Data services

    Data services are technology-driven solutionsincluding software, cloud-based tools (DaaS), and managed servicesthat facilitate the access, integration, storage, management, and analysis of data. They act as middleware to modernize legacy systems, improve data quality, and enable real-time, on-demand data access for applications.

    Red Hat

    Red Hat

    +5

    Key components of data services include:

    Data as a Service (DaaS): A cloud-based model where a provider handles data storage, integration, and processing, delivering it on-demand to users.

    Data Management & Integration: Services that include ETL/ELT (extract, transform, load), data cleansing, migration, and data governance.

    Data Processing & Analytics: The use of AI and machine learning to analyze data for business insights.

    Business Data Services (BDS): High-capacity connections, such as Ethernet or specialized access services, for enterprise networking.

    Avenga

    Avenga

    +4

    Data services help organizations break down silos by providing a unified view of data, enhancing decision-making, and streamlining operational efficiency.

    SAP

    SAP

    +1

  • CSE6242 Homework: designing a good table. Visualizing data w…

    Design a data table, grouped bar chart, and stacked bar chart in Tableau using board game datasets to compare game categories, playtime groups, and mechanics, while applying filtering by maximum players and exporting the resulting visualizations as images.
    You will need to use a (username placeholder) as user name on the bottom -left corner for each image.

    4 PNG files in total

  • CSE6242 Homework: designing a good table. Visualizing data w…

    Design a data table, grouped bar chart, and stacked bar chart in Tableau using board game datasets to compare game categories, playtime groups, and mechanics, while applying filtering by maximum players and exporting the resulting visualizations as images.
    You will need to use ychai64 as user name on the bottom -left corner for each image.

    4 PNG files in total