Data Archive
Search
Search
Dark mode
Light mode
Explorer
Inbox
Monthly_Review
Archive
October 2024
November 2024
Inbox Tasks
pages
Data Archive
Overview
Questions
standardised
AB testing
Accessing Gen AI generated content
Accuracy
ACID Transaction
Activation Function
Active Learning
Ada boosting
Adjusted R squared
Agent-based modelling
Apache Kafka
API
API Driven Microservices
Attention Is All You Need
Attention mechanism
AWS Lambda
Azure
Backpropagation in Neural Networks
Bag of words
Bagging
Batch Processing
BERT
BERT Pretraining of Deep Bidirectional Transformers for Language Understanding
Bias and variance
Big Data
BigQuery
Boosting
Boxplot
CatBoost
Central Limit Theorem
Chain of thought
Change Management
CI-CD
Classification
Classification Metrics
Classification Report
Clean before ER diagram
Clustering
Command Prompt
Communication principles
Communication Techniques
Confidence Interval
Confusion Matrix
Convolutional Neural Networks
Correlation
Correlation vs Causation
Cross Entropy
Cross validation
CTE (Common Table Expression)
Data AI Education at Work
Data Analysis
Data Cleansing
Data Collection
Data Engineer
Data Engineering
Data Management
Data Mining - CRISP
Data Modelling
Data Orchestration
Data Pipeline
Data Pipeline to Data Products
Data Preprocessing
Data Principles
Data Science
Data Selection
Data Streaming
Data Validation
Database
Databricks
DBScan
dbt
Decision Tree
Deep Learning Frameworks
Deep Learning Overview
Difference between Databricks vs. Snowflake
Difference between snowflake to hadoop
Digital Transformation
Digital twin
dimensional modeling
Dimensionality Reduction
Directed Acyclic Graph (DAG)
Directory Structure
Distributed Computing
Distributions
Dropout
EDA
Edge Machine Learning Models
Elastic Net
ELT
Embedded Methods
Encoding Categorical Variables
Energy
Energy ABM
Energy Storage
Estimator
ETL Pipeline example
ETL vs. ELT
etlt
Evaluating Language Models
Event Driven Microservices
Everything
Excel & Sheets
Explain the curse of dimensionality
Exploration vs. Exploitation
F1 Score
Fabric
Feature Engineering
Feature Extraction
Feature Importance
Feature Preprocessing
Feature Scaling
Feature Selection
Feature Selection vs Feature Importance
Flask
Folder Tree Diagram
Forward Propagation in Neural Networks
Full Lifecycle Management
Gini Impurity
Gini Impurity vs Cross Entropy
Google My Maps Data Extraction
Gradient Boosting
Gradient Descent
GridSeachCv
Groupby
Groupby vs Crosstab
Guardrails
Hadoop
Handling Missing Data
Heatmap
How businesses use Gen AI
Hyperparameter
Imbalanced Datasets
Industries of interest
inference
inference versus prediction
information theory
Interpretability
Isolation Forest and Its Use in Anomaly Detection
Java vs JavaScript
Json
Json to Yaml
K-means
Knowledge Graph
Knowledge graph vs RAG setup
Knowledge Work
Language Models Large (LLMs) vs Small (SLMs)
Lasso
learning rate
Learning Styles
lemmatization
LightGBM
LightGBM vs XGBoost vs CatBoost
Linear Regression
LLM
Local Interpretable Model-agnostic Explanations
Logistic Regression
loss function
LSTM
Machine Learning Algorithms
Machine Learning Operations
Machine Learning Workflow
Manifold learning
Markov Decision Processes
Microsoft Access
Model Ensembling
Model Evaluation
Model Evaluation vs Model Optimisation
Model Interpretability
Model Observability
Model Optimisation
Model Selection
Monolith Architecture
Multi-head attention
Multicollinearity
Natural Language Processing
Network Design
Neural network
Ngrams
Normalisation of Text
OLTP
Optimisation function
Optuna
Ordinary Least Squares
Outliers
Overfitting in Machine Learning
parametric vs non-parametric models
parsimonious
Part of speech tagging
PowerBI
Precision
Precision or Recall
Principal Component Analysis
Prompt Exploring Topics
Prompt Extracting information from blog posts
Prompting
Python
PyTorch
Q-Learning
Quartz
R squared
RAG
Recall
Recurrent Neural Networks
Regression Analysis and its Applications
Regression metrics
Regularisation of Tree based models
Regularization in Machine Learning
Reinforcement learning
Ridge
ROC and AUC
Sarsa
Scala
Scientific Method
semantic layer
Sensitivity
shapefile
SHapley Additive exPlanations
Sharepoint
Single source of truth
Small Language Models
Smart Grids
SMSS
Software Development Life Cycle
Specificity
Stack
Stemming
Stochastic Gradient Descent
Storage solutions
Strongly vs Weakly typed language
Summarisation
Supervised Learning
Support Vector Machines
Support Vector Regression
Symbolic computation
t-SNE
Technical Debt
Terminal commands
The Data Hierarchy of Needs
Tokenisation
Transfer Learning
Transformer
Transformers vs RNNs
TypeScript
unstructured data
Unsupervised learning
Use of RNNs in energy sector
Vector Embedding
Violin plot
WCSS and elbow method
Weak Learners
Web Map Tile Service (WMTS)
What are Data Processing Techniques (row-based, columnar, vectorized)?
What are Dimensions?
What are the top Cloud Providers?
What is a Data Lake?
What is a Data Lakehouse?
What is a Data Product?
What is a Data Warehouse?
What is a Jinja Template?
What is a Lambda Architecture?
What is a Metric?
What is a policy in RL
What is a Storage Layer / Object Store?
What is an In-Memory Format?
What is Apache Airflow?
What is Apache Spark?
What is Business Intelligence
What is Data Governance?
What is Data Integration?
What is Data Lineage?
What is Data Literacy?
What is Data Observability?
What is Data Quality?
What is data transformation?
What is declarative?
What is DevOps?
What is ETL?
What is Granularity
What is imperative?
What is Kubernetes?
What is Machine Learning?
What is MapReduce?
What is Master Data Management (MDM)?
What is Normalization?
What is OLAP (Online Analytical Processing)?
What is Reverse ETL?
What is Schema Evolution?
What is semi-structured data?
What is Slowly Changing Dimension?
What is SQL?
What is structured data?
What is the Big-O Notation?
What is YAML?
windows pip install
Wrapper Methods
XGBoost
Tasks
Data Archive
Questions
Overview
Tasks
Home
❯
tags
❯
Tag: data_pipeline
Tag: data_pipeline
2 items with this tag.
20 Nov 2024
Data Pipeline to Data Products
question
data_orchestration
anomaly_detection
data_pipeline
data_products
20 Nov 2024
Data Pipeline
data_pipeline