Enhancing Data Security and Privacy in the Cloud with AWS Clean Rooms Data Science by Sunny Srinidhi - May 26, 2023May 26, 20230 Data security and privacy are now top priorities for both individuals and corporations in today’s digital world. Organisations are relying more and more on cloud service providers as a result of the growing popularity of cloud computing to store, manage, and process their sensitive data. Read more... “Enhancing Data Security and Privacy in the Cloud with AWS Clean Rooms”
Cleaning and Normalizing Data Using AWS Glue DataBrew Data Science by Sunny Srinidhi - January 17, 2022January 17, 20221 A major part of any data pipeline is the cleaning of data. Depending on the project, cleaning data could mean a lot of things. But in most cases, it means normalizing data and bringing data into a format that is accepted within the project. Read more... “Cleaning and Normalizing Data Using AWS Glue DataBrew”
Getting started with Chalice to create AWS Lambdas in Python – Step by Step Tutorial Tech by Sunny Srinidhi - November 14, 2019November 14, 20190 If you’re into serverless stuff, you already know what is AWS Lambda. But if you don’t know, AWS Lambda is a serverless service provided by Amazon where you can create ‘functions’ and deploy them in AWS, which you can run without having any server instances (such as EC2). Read more... “Getting started with Chalice to create AWS Lambdas in Python – Step by Step Tutorial”
Invoke an AWS Lambda Function from another Lambda Function Data Science Tech by Sunny Srinidhi - November 4, 2019November 4, 20190 I recently discovered that you can't invoke more than one Lambda function in AWS for an S3 event, with the same prefix and suffix (or just with the same suffix, which was the issue in my case). So I wanted a way to invoke one Lambda function from another Lambda function. If you're feeling kind of lost, check out the problem statement in my Github project. That could possibly add some context to the problem. If you don't want to go there, I'll try to explain it here again. The Problem and the Requirement In one of our projects, we have a Lambda function which is invoked whenever a text file is uploaded to a particular S3 bucket. The Lambda function takes
How to automatically trigger AWS Lambda functions using CloudWatch Tech by Sunny Srinidhi - November 2, 20190 If you have AWS Lambda functions which need to be triggered periodically, like CRON jobs, there are many ways to achieve this. But I recently discovered a very easy and AWS-way of doing this, which makes life a lot easier. So, there are a lot of ways you can trigger Lambda functions periodically. One of the most common ways I've see people doing this is adding an API Gateway to the Lambda function, and then calling that API periodically as a CRON job from one of the machines in the setup. I actually thought this is how you're supposed do to that. Okay, let me make this clear. I'm not a DevOps guy. I just learn these things as and when
Integrate AWS DynamoDB with Spring Boot Tech by Sunny Srinidhi - June 26, 2019March 12, 20200 Here is another POC to add to the growing list of POCs on my Github profile. Today, we’ll see how to integrate AWS DynamoDB with a Spring Boot application. This is going to be super simple, thanks to the AWS Java SDK and the Spring Data DynamoDB package. Let’s get started then. Dependencies First, as usual, we need to create a Spring Boot project, the dependencies of which look like: <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter</artifactId> </dependency> <dependency> <groupId>com.amazonaws</groupId> <artifactId>aws-java-sdk-dynamodb</artifactId> <version>1.11.573</version>
Use Amazon CloudSearch to quickly search through data Tech by Sunny Srinidhi - March 29, 2023March 29, 20230 Most applications today require a search functionality in them to let users search for content easily and quickly. But building that search feature is not a small task. It often requires specialized knowledge and massive compute resources to be able to search through massive amounts of data quickly. Read more... “Use Amazon CloudSearch to quickly search through data”
Getting Started With Apache Airflow Data Science by Sunny Srinidhi - October 11, 2021October 11, 20210 Apache Airflow is another awesome tool that I discovered just recently. Just a couple of months after discovering it, I can’t imagine not using it now. It’s reliable, configurable, and dynamic. Because it’s all driven by code, you can version control it too. Read more... “Getting Started With Apache Airflow”
I made a website which tells if you’re wearing a mask or not – without machine learning Tech by Sunny Srinidhi - January 11, 2021January 11, 20211 Soon after the lockdown, when we all started working from home, there was a sudden increase in the number of people who wanted to write machine learning code. And not just any machine learning code, but one specifically to check if a person is wearing a mask or not. Read more... “I made a website which tells if you’re wearing a mask or not – without machine learning”
Explore your Amazon S3 data online using Filestash Tech by Sunny Srinidhi - April 29, 2020April 29, 20200 Amazon’s S3, or Simple Storage Service, has become one of the most used cloud services today. We use it for all kind of purposes, including but not limited to data lakes, intermediary storage, persistence layer for databases, etc. I know people who use S3 as their personal online storage, as an alternative for services such Google Drive and Dropbox. Read more... “Explore your Amazon S3 data online using Filestash”