Data Archive
Search
Search
Dark mode
Light mode
Explorer
pages
Data Archive
Machine Learning Workflow
ML_Tools
Questions
Quotes
Tasks
standardised
AB testing
Accessing Gen AI generated content
Accuracy
ACID Transaction
Activation Function
Active Learning
Ada boosting
Adam Optimizer
Adaptive Learning Rates
Adjusted R squared
Agent-based modelling
Agentic Solutions
Anomaly Detection
Apache Kafka
API
API Driven Microservices
ARIMA
Attention Is All You Need
Attention mechanism
AUC
AWS Lambda
Azure
Backpropagation in Neural Networks
Bag of words
Bagging
Baseline Forecasting
Batch Normalistion
Batch Processing
BERT
BERT Pretraining of Deep Bidirectional Transformers for Language Understanding
Bias and variance
Big Data
BigQuery
binary classification
Boosting
Boxplot
CatBoost
Central Limit Theorem
Chain of thought
Change Management
Choosing the Number of Clusters
CI-CD
Classification
Classification Report
Clean before ER diagram
Clustering
Command Prompt
Communication principles
Communication Techniques
Confidence Interval
Confusion Matrix
Convolutional Neural Networks
Correlation
Correlation vs Causation
Cosine Similarity
Cost Function
Cross Entropy
Cross validation
CTE (Common Table Expression)
Data AI Education at Work
Data Analysis
Data Cleansing
Data Collection
Data Engineer
Data Engineering
Data Leakage
Data Management
Data Mining - CRISP
Data Modelling
Data Orchestration
Data Pipeline
Data Pipeline to Data Products
Data Principles
Data Science
Data Selection
Data Streaming
Data Validation
Database
Databricks
Datasets
DBScan
dbt
Decision Tree
Deep Learning Frameworks
Deep Learning Overview
Deep Q-Learning
Determining Threshold Values
Difference between Databricks vs. Snowflake
Difference between snowflake to hadoop
Digital Transformation
Digital twin
dimensional modeling
Dimensionality Reduction
Directed Acyclic Graph (DAG)
Directory Structure
Distributed Computing
Distributions
EDA
Edge Machine Learning Models
Elastic Net
ELT
Embedded Methods
Encoding Categorical Variables
Energy
Energy ABM
Energy Storage
Epoch
Estimator
ETL Pipeline example
ETL vs. ELT
etlt
Evaluating Language Models
Evaluation Metrics
Event Driven Microservices
Everything
Excel & Sheets
Explain the curse of dimensionality
Exploration vs. Exploitation
Exponential Smoothing
F1 Score
Fabric
Feature Engineering
Feature Extraction
Feature Importance
Feature Scaling
Feature Selection
Feature Selection vs Feature Importance
Feed Forward Neural Network
Fitting weights and biases of a neural network
Flask
Folder Tree Diagram
Forward Propagation in Neural Networks
Full Lifecycle Management
Gaussian Distribution
Gini Impurity
Gini Impurity vs Cross Entropy
Google My Maps Data Extraction
Gradient Boosting
Gradient Boosting Regressor
Gradient Descent
Gradio
GridSeachCv
Groupby
Groupby vs Crosstab
Guardrails
Hadoop
Handling Different Distributions
Handling Missing Data
Heatmap
How businesses use Gen AI
Hugging Face
Hyperparameter
Hyperparameter Tuning
Imbalanced Datasets
Industries of interest
inference
inference versus prediction
information theory
Interpretability
Isolation Forest and Its Use in Anomaly Detection
Java vs JavaScript
Json
Json to Yaml
K-means
Knowledge Graph
Knowledge graph vs RAG setup
Knowledge Work
Language Models Large (LLMs) vs Small (SLMs)
Lasso
LBFGS
learning rate
Learning Styles
lemmatization
LightGBM
LightGBM vs XGBoost vs CatBoost
Linear Regression
LLM
Local Interpretable Model-agnostic Explanations
Logistic Regression
Logistic regression in sklearn & Gradient Descent
loss function
LSTM
Machine Learning Algorithms
Machine Learning Operations
Manifold learning
Markov Decision Processes
Methods for Handling Outliers
Microsoft Access
Model Building
Model Deployment
Model Ensembling
Model Evaluation
Model Evaluation vs Model Optimisation
Model Interpretability
Model Observability
Model Optimisation
Model Parameters
Model parameters vs hyperparameters
Model Selection
Momentum
Monolith Architecture
Multi-head attention
Multicollinearity
Natural Language Processing
Network Design
Neural network
Neural Network Classification
Neural network in Practice
Ngrams
Normalisation
Normalisation of Text
Normalisation vs Standardisation
npy Files A NumPy Array storage
OLTP
One-hot encoding
Optimisation function
Optimisation techniques
Optimising a Logistic Regression Model
Optimising Neural Networks
Optuna
Ordinary Least Squares
Orthogonalization
Outliers
Overfitting in Machine Learning
parametric vs non-parametric models
parsimonious
Part of speech tagging
PowerBI
Precision
Precision or Recall
Precision-Recall Curve
Preprocessing
Principal Component Analysis
Prompt Exploring Topics
Prompt Extracting information from blog posts
Prompting
PyCaret
Python
PyTorch
Q-Learning
Quartz
R squared
RAG
Recall
Recurrent Neural Networks
Regression Analysis and its Applications
Regression metrics
Regularisation of Tree based models
Regularization in Machine Learning
Reinforcement learning
Ridge
ROC (Receiver Operating Characteristic)
Sarsa
Scala
Scaling Agentic Systems
Scientific Method
semantic layer
Sensitivity
shapefile
SHapley Additive exPlanations
Sharepoint
Silhouette Analysis
Single source of truth
Small Language Models
Smart Grids
SMSS
Software Development Life Cycle
Specificity
Stack
Standardisation
Stemming
Stochastic Gradient Descent
Storage solutions
Strongly vs Weakly typed language
Summarisation
Supervised Learning
Support Vector Machines
Support Vector Regression
Symbolic computation
t-SNE
Technical Debt
Terminal commands
Test Loss When Evaluating Models
The Data Hierarchy of Needs
Time Series
Time Series Forecasting
Tokenisation
Train-Dev-Test Sets
Transfer Learning
Transformer
Transformers vs RNNs
Types of Neural Networks
TypeScript
Typical Output Formats in Neural Networks
unstructured data
Unsupervised learning
Use of RNNs in energy sector
variance
Vector Database
Vector Embedding
Vectorisation
Violin plot
WCSS and elbow method
Weak Learners
Web Map Tile Service (WMTS)
What are Data Processing Techniques (row-based, columnar, vectorized)?
What are Dimensions?
What are the top Cloud Providers?
What is a Data Lake?
What is a Data Lakehouse?
What is a Data Product?
What is a Data Warehouse?
What is a Jinja Template?
What is a Lambda Architecture?
What is a Metric?
What is a policy in RL
What is a Storage Layer / Object Store?
What is an In-Memory Format?
What is Apache Airflow?
What is Apache Spark?
What is Business Intelligence
What is Data Governance?
What is Data Integration?
What is Data Lineage?
What is Data Literacy?
What is Data Observability?
What is Data Quality?
What is data transformation?
What is declarative?
What is DevOps?
What is ETL?
What is Granularity
What is imperative?
What is Kubernetes?
What is Machine Learning?
What is MapReduce?
What is Master Data Management (MDM)?
What is Normalization?
What is OLAP (Online Analytical Processing)?
What is Reverse ETL?
What is Schema Evolution?
What is semi-structured data?
What is Slowly Changing Dimension?
What is SQL?
What is structured data?
What is the Big-O Notation?
What is YAML?
Why does the Adam Optimizer converge
windows pip install
Wrapper Methods
XGBoost
Z-Normalisation
Data Archive
Questions
Tasks
Home
❯
tags
❯
Tag: data_management
Tag: data_management
9 items with this tag.
06 Jan 2025
What is Master Data Management (MDM)?
data_storage
data_governance
data_management
06 Jan 2025
The Data Hierarchy of Needs
data_management
06 Jan 2025
What is Data Lineage?
data_management
06 Jan 2025
What is a Data Product?
data_management
business
06 Jan 2025
Distributed Computing
data_management
data_processing
06 Jan 2025
Full Lifecycle Management
data_management
06 Jan 2025
Home
statistics
regressor
data_exploration
classifier
evaluation
clustering
deep_learning
anomaly_detection
model_architecture
model_algorithm
ml_process
ml_optimisation
model_explainability
GenAI
language_models
NLP
database
data_storage
data_modeling
data_cleaning
data_transformation
data_processing
software
data_orchestration
business
code_snippet
software_architecture
cmd
career
energy
data_visualization
math
drafting
question
data_quality
data_governance
data_management
field
communication
06 Jan 2025
Data Management
data_management
06 Jan 2025
What is Data Observability?
data_orchestration
data_management