I used AI to build a SaaS, and here’s my experience

Tech

by Sunny Srinidhi - July 16, 2025July 16, 20250

A seasoned developer reflects on his journey with coding, particularly frontend technologies. Despite initial fears surrounding frameworks like React, he successfully embraced Swift for personal projects, enhancing his coding skills. However, his experience using AI tools like Cursor for web apps revealed limitations and frustrations, leading him back to Swift for reliability and confidence.

The Impact of Generative AI on Data Engineering

Data Science

by Sunny Srinidhi - March 9, 2025March 9, 20250

Generative AI is transforming the field of data engineering by automating complex processes such as data augmentation, cleaning, integration, and anomaly detection. Unlike traditional AI, which focuses on analysis and prediction, Generative AI creates new data based on learned patterns. This capability improves data quality, enhances efficiency, and enables scalable solutions. However, challenges like data privacy, model bias, and ethical concerns must be carefully managed. As AI technology advances, its role in data engineering will continue to expand, leading to more intelligent and automated data workflows.

The Road Ahead: Key Data Engineering Trends for 2025

by Sunny Srinidhi - December 31, 2024December 31, 20240

As we step into 2025, the world of data engineering is poised for transformative growth. From the rise of unified data architectures to the integration of AI-driven tools, the landscape is evolving faster than ever. This blog explores the key trends shaping the future—real-time data processing, edge computing, enhanced data governance, and more—while providing actionable insights on how professionals and organizations can adapt. Whether you’re a seasoned data engineer or just starting your journey, this comprehensive guide will help you navigate the challenges and seize the opportunities of 2025 with confidence.

Data Automation with AI/ML: A Comprehensive Guide

by Sunny Srinidhi - November 28, 20240

The article discusses the transformative impact of artificial intelligence (AI) and machine learning (ML) on data automation, enhancing efficiency, decision-making, and scalability in businesses. It explores trends like generative AI, AutoML, data governance, and democratization while providing real-world applications across various industries, ultimately guiding businesses in effective AI/ML integration.

Understanding Data Governance: A Comprehensive Guide

by Sunny Srinidhi - October 18, 2024October 18, 20240

Data governance is a set of practices, policies, and standards that ensure data is managed as an asset in a consistent and reliable manner across an organization. It involves defining who owns the data, who has the right to make decisions about it, and how it can be used. This comprehensive guide aims to shed light on what data governance entails, its importance, how it can be achieved, best practices, and who should be involved in the process. What is Data Governance? Data governance refers to the collection of policies, roles, responsibilities, and procedures that oversee the management of data assets within an organization. It ensures that data is accurate, consistent, accessible, and protected from misuse. The main goal of data governance

Installing Hadoop on Windows 11 with WSL2

Data Science

by Sunny Srinidhi - November 1, 2021November 1, 20213

We’ll see how to install and configure Hadoop and it’s components on Windows 11 running a Linux distro using WSL 1 or 2.

Installing Zsh and Oh-my-zsh on Windows 11 with WSL2

Tech

by Sunny Srinidhi - October 27, 2021October 27, 20211

In this post, which is a part of a series of to setup Windows 11 and WSL2 for big data work, I install Zsh and Oh-my-zsh and setup up aliases

Lemmatization in Natural Language Processing (NLP) and Machine Learning

Data Science

by Sunny Srinidhi - February 26, 2020February 26, 20200

Lemmatization is one of the most common text pre-processing techniques used in Natural Language Processing (NLP) and machine learning in general. If you've already read my post about stemming of words in NLP, you'll already know that lemmatization is not that much different. Both in stemming and in lemmatization, we try to reduce a given word to its root word. The root word is called a stem in the stemming process, and it is called a lemma in the lemmatization process. But there are a few more differences to the two than that. Let's see what those are. How is Lemmatization different from Stemming In stemming, a part of the word is just chopped off at the tail end to arrive at

Stemming of words in Natural Language Processing, what is it?

Data Science

by Sunny Srinidhi - February 19, 2020August 27, 20241

Stemming is one of the most common data pre-processing operations we do in almost all Natural Language Processing (NLP) projects. If you're new to this space, it is possible that you don't exactly know what this is even though you have come across this word. You might also be confused between stemming and lemmatization, which are two similar operations. In this post, we'll see what exactly is stemming, with a few examples here and there. I hope I'll be able to explain this process in simple words for you. Stemming To put simply, stemming is the process of removing a part of a word, or reducing a word to its stem or root. This might not necessarily mean we're reducing a word

Removing stop words in Java as part of data cleaning in Artificial Intelligence

Data Science

by Sunny Srinidhi - February 5, 2020February 5, 20200

More in The fastText Series. Working with text datasets is very common in data science problems. A good example of this is sentiment analysis, where you get social network posts as data sets. Based on the content of these posts, you need to estimate the sentiment around a topic of interest. When we're working with text as the data, there are a lot of words which we want to remove from the data to "clean" it, such as normalising, removing stop words, stemming, lemmatizing, etc. In this post, we'll see how we can remove stop words from our input text to clean our data so that our analysis is based only on the actual content of the data. But wait, what are stop