Removing stop words in Java as part of data cleaning in Artificial IntelligenceData Science by Sunny Srinidhi - February 5, 2020February 5, 20200 More in The fastText Series. Working with text datasets is very common in data science problems. A good example of this is sentiment analysis, where you get social network posts as data sets. Based on the content of these posts, you need to estimate the sentiment around a topic of interest. When we're working with text as the data, there are a lot of words which we want to remove from the data to "clean" it, such as normalising, removing stop words, stemming, lemmatizing, etc. In this post, we'll see how we can remove stop words from our input text to clean our data so that our analysis is based only on the actual content of the data. But wait, what are stop
Descriptive and Inferential statistics – the two types of statisticsData Science by Sunny Srinidhi - January 30, 2020January 30, 20200 In this post, we’re looking at two major types of statistics – descriptive statistics and inferential statistics.
Binary Tree Implementation in JavaTech by Sunny Srinidhi - January 27, 2020January 27, 20201 In this post, we will see how we can implement a binary tree in Java. We will be using Double Linked List internally for this implementation.
Sorting in MongoDB in Java using BasicDBObjectTech by Sunny Srinidhi - January 24, 2020January 24, 20200 In this post, we’ll see how we can write a sort query for MongoDB in Java using the BasicDBObject class. I’ll use Spring Boot for this.
Emulating Apache Kafka with Amazon SNS and SQSTech by Sunny Srinidhi - January 22, 2020January 24, 20200 We’ll learn how to introduce the concept of consumer groups from Kafka in the AWS world using Amazon SNS and Amazon SQS.
Publishing messages to Amazon SNS from a Spring Boot applicationTech by Sunny Srinidhi - January 20, 2020January 24, 20200 We’ll learn how we can publish messages to an SNS topic from a Spring Boot application. This can be done from any Java code or framework.
Receiving messages from Amazon SQS in a Spring Boot applicationTech by Sunny Srinidhi - January 16, 2020January 24, 20200 In this post, we'll see how we can receive messages from an Amazon SQS queue in a Spring Boot application. This is a continuation of the previous post where we talked about how we can send messages to an SQS queue. The obvious next part of that is how do we receive those messages. So in this post, we'll do just that. If you don't have an Amazon SQS queue created already, checkout the previous post on how to do it. Here, I'll assume that you already have that pipeline setup. So I'm going to skip that part of the post. We'll jump right into the code. The Code The first thing we need to add in our Spring Boot application is the
Sending messages to Amazon SQS from a Spring Boot applicationTech by Sunny Srinidhi - January 14, 2020January 24, 20203 We're looking at yet another proof of concept (POC) application today. We're going to see how we can integrate Amazon SQS, which stands for Simple Queue Service into our Spring Boot application so that we can send messages to the queue. I'm going to use a few terms in this post which are influenced by Apache Kafka, because I come with extensive Kafka experience. However, I'm not going to compare Apache Kafka and Amazon SQS here. To clear things up, any service which sends a message to an SQS queue, I'll refer to such a service as the producer. And any service which receives a message from an SQS queue, I'll refer to that as the consumer. Now that we
Circular Double Linked List Implementation in JavaTech by Sunny Srinidhi - January 10, 2020January 16, 20200 More in The Data Structures series. We'll continue our data structures journey with this post about how to implement a circular Double Linked List (DLL) in Java. This is very similar to the standard DLL with the only difference being the connection of the head with the tail. That means, we link the head the tail to each other, which we can visualise as a circle, because a circle has no start and no end. Because the head and the tail of the list are connected to each other, we can say that there is no start and no end. But of course, we have references to both the head and the tail, to make our traversal easy. If you have not
Using Google’s libphonenumber Library to Parse and Validate Phone NumbersTech by Sunny Srinidhi - January 9, 2020January 9, 20200 We all work with phone numbers in almost any project or product which has human users. And when the product is available to a global user base, it becomes very difficult to maintain valid phone numbers in the database. We need to make sure the phone numbers for different regions are of the proper length for their regions, add country codes, or remove them, and a lot of such validations. This could become a project of its own pretty soon. We had such an issue in one of our projects. When I was doing the research to find an easy to use and light weight tool so that I could outsource the smarts involved in this to, I came across the