Enhancing Data Security and Privacy in the Cloud with AWS Clean Rooms Data Science by Sunny Srinidhi - May 26, 2023May 26, 20230 Data security and privacy are now top priorities for both individuals and corporations in today’s digital world. Organisations are relying more and more on cloud service providers as a result of the growing popularity of cloud computing to store, manage, and process their sensitive data. Read more... “Enhancing Data Security and Privacy in the Cloud with AWS Clean Rooms”
Use Amazon CloudSearch to quickly search through data Tech by Sunny Srinidhi - March 29, 2023March 29, 20230 Most applications today require a search functionality in them to let users search for content easily and quickly. But building that search feature is not a small task. It often requires specialized knowledge and massive compute resources to be able to search through massive amounts of data quickly. Read more... “Use Amazon CloudSearch to quickly search through data”
Apache Spark Optimisation Techniques Data Science by Sunny Srinidhi - February 23, 2023February 23, 20230 Apache Spark is one of the most popular big data processing tools today. It’s used extensively for data sizes small to large. The availability of Spark in more than one programming language makes it a favourite tool for data engineers and data scientists coming from various backgrounds. Read more... “Apache Spark Optimisation Techniques”
Optimising Hive Queries with Tez Query Engine Data Science by Sunny Srinidhi - June 13, 2022June 13, 20220 Hive provides us the option of executing SQL queries with a few different query engines. It ships with the native MapReduce engine. But we can switch that to Tez which has gained popularity since its launch, or we can also use Apache Spark as well. Read more... “Optimising Hive Queries with Tez Query Engine”
Cleaning and Normalizing Data Using AWS Glue DataBrew Data Science by Sunny Srinidhi - January 17, 2022January 17, 20221 A major part of any data pipeline is the cleaning of data. Depending on the project, cleaning data could mean a lot of things. But in most cases, it means normalizing data and bringing data into a format that is accepted within the project. Read more... “Cleaning and Normalizing Data Using AWS Glue DataBrew”
The Dunning-Kruger Effect In Tech Tech by Sunny Srinidhi - November 28, 2021December 18, 20210 This is not the kind of post I usually write on my blog. This is more of a psychology lecture than a how-to tech tutorial. But it’s not completely irrelevant as well, because I’m going to talk about my experience with the Dunning-Kruger effect in tech that I’ve seen over the last decade. Read more... “The Dunning-Kruger Effect In Tech”
Understanding Apache Hive LLAP Data Science by Sunny Srinidhi - November 18, 2021November 18, 20210 Apache Hive is a complex system when you look at it, but once you go looking for more info, it’s more interesting than complex. There are multiple query engines available for Hive, and then there’s LLAP on top of the query engines to make real-time, interactive queries more workable. Read more... “Understanding Apache Hive LLAP”
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro Data Science by Sunny Srinidhi - November 5, 2021November 5, 20213 In the previous series of posts, I wrote about how to install the complete Hadoop stack on Windows 11 using WSL 2. And now that the new MacBook Pro laptops are available with the brand new M1 Pro and M1 Max SOCs, here’s a guide on how to install the same Hadoop stack on these laptops. Read more... “Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro”
Installing Hadoop on Windows 11 with WSL2 Data Science by Sunny Srinidhi - November 1, 2021November 1, 20213 In the previous post, we saw how to install a Linux distro on Windows 11 using WSL2 and then how to install Zsh and on-my-zsh to make the terminal more customizable. In this post, we’ll see how we can install the complete Hadoop environment on the same Windows 11 machine using WSL. Read more... “Installing Hadoop on Windows 11 with WSL2”
Installing Zsh and Oh-my-zsh on Windows 11 with WSL2 Tech by Sunny Srinidhi - October 27, 2021October 27, 20211 Before we begin, you might ask, why am I writing on something this trivial? I sold off my old MacBook Pro because I’m super excited about the new M1 Pro MacBook Pros. I have pre-ordered one of those and am waiting for it to come. Read more... “Installing Zsh and Oh-my-zsh on Windows 11 with WSL2”