Indraneel Dutta BaruahinTowards DevExploring Big Data with Apache Spark: Spark Performance Optimisation and Common ErrorsLet’s optimize spark performance using parquets, avoiding UDFs, caching techniques, shorter lineages, etc.Apr 1Apr 1
Indraneel Dutta BaruahinDev GeniusExploring Big Data with Apache Spark: Architecture, Memory Management, and ConfigurationHow is Spark architecture set up? What are the different memory regions in Spark? How to configure Spark correctly?Mar 28Mar 28
Indraneel Dutta BaruahExploring Big Data with Apache Spark: Introduction and Key ComponentLet’s deep dive into what makes Apache Spark great — DAGs, RDDs, Dataframes, DatasetsMar 28Mar 28
Indraneel Dutta BaruahinTowards AIHow Do Inherently Interpretable AI Models Work? — GAMINETDeep dive inside GAMINET — sparsity and hereditary constraints, marginal clarity, main effects, interaction effects, etc.Dec 20, 20231Dec 20, 20231
Indraneel Dutta BaruahinNerd For TechHow Do Inherently Interpretable AI Models Work? — Explainable Boosting MachineLifting the curtain behind Explainable Boosting Machine (EBMs) — how does it calculate main and interaction effects?Dec 16, 2023Dec 16, 2023
Indraneel Dutta BaruahinNerd For TechHow Do Inherently Interpretable AI Models Work? — General Additive ModelsUnderstanding the math behind GAMs including link function and functional formDec 16, 2023Dec 16, 2023
Indraneel Dutta BaruahinNerd For TechExploring Big Data with Apache Spark: Introduction and Key ComponentsA comprehensive guide on how Apache Spark works and how to use it efficiently!Dec 16, 2023Dec 16, 2023
Indraneel Dutta BaruahinNerd For TechDimensionality Reduction Techniques — PCA, LCA and SVDLet’s learn about PCA, LCA, and SVD. Their pros, cons, and when to use along with their Python implementation.Oct 7, 20233Oct 7, 20233
Indraneel Dutta BaruahinANOLYTICSAll you need to know about encoding techniques!How to use label encoding, one hot encoding, catboost encoding, etc. along with its Python implementation!Sep 30, 20231Sep 30, 20231
Indraneel Dutta BaruahinTowards DevHow to handle imbalanced data — Undersampling and Hybrid techniquesLet’s learn about ClusterCentroids, Tomek links, NearMiss, RepeatedEditedNearestNeighbours, etc., and their Python implementation!Sep 24, 2023Sep 24, 2023