Skip to content
Thursday, February 25
  • About Me
  • Must Watch Videos
  • Proof of Concepts (POCs)

The Tech Check

The Tech Check

  • Data Science
  • Tech
  • General
  • Proof of Concepts (POCs)
  • About Me / Products
  • Must Watch Videos
  • Data Science
  • Tech
  • General
  • Proof of Concepts (POCs)
  • About Me / Products
  • Must Watch Videos
Trending Now
  • I made a website which tells if you’re wearing a mask or not – without machine learning
  • Free apps vs. Paid apps
  • Binary Search Tree Implementation in Java
  • Different ways of iterating on a HashMap in Java
  • The art of load balancing – Part 2
  • The art of load balancing – Part 1 (Understanding a load balancer)
Home>>bigdata

Tag: bigdata

statistics-on-laptop
Data Science

Null Hypothesis and the P-Value

Sunny SrinidhiNovember 8, 2019 1873 Views5

When you're starting your machine learning journey, you'll come across null hypothesis and the p-value. At a certain point in your journey, it becomes quite important to know what these mean to make ...

Read More
scikit_learn_logo
Data Science

Fit vs. Transform in SciKit libraries for Machine Learning

Sunny SrinidhiNovember 7, 2019 1389 Views0

We have seen methods such as fit(), transform(), and fit_transform() in a lot of SciKit's libraries. And almost all tutorials, including the ones I've written, only tell you to just use one of these ...

Read More
apache_kafka_streams
Data ScienceTech

Apache Kafka Streams and Tables, the stream-table duality

Sunny SrinidhiOctober 1, 2019 2248 Views0

In the previous post, we tried to understand the basics of Apache's Kafka Streams. In this post, we'll build on that knowledge and see how Kafka Streams can be used both as streams and tables. St...

Read More
Amazon Kinesis Firehose
Data ScienceTech

Put data to Amazon Kinesis Firehose delivery stream using Spring Boot

Sunny SrinidhiSeptember 26, 2019 2191 Views1

If you work with streams of big data which have to be collected, transformed, and analysed, you for sure would have heard of Amazon Kinesis Firehose. It is an AWS service used to load streams of data...

Read More
amazon athena
Data ScienceTech

How to Query Athena from a Spring Boot application?

Sunny SrinidhiSeptember 25, 2019 2419 Views0

In the last post, we saw how to query data from S3 using Amazon Athena in the AWS Console. But querying from the Console itself if very limited. We can't really do much with the data, and anytime we ...

Read More
amazon athena
Data ScienceTech

Query data from S3 files using Amazon Athena

Sunny SrinidhiSeptember 24, 2019 3459 Views1

Amazon Athena is defined as "an interactive query service that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL." So, it's another SQL query engi...

Read More
Spark
Data ScienceTech

Apache Drill vs. Apache Spark – Which SQL query engine is better for you?

Sunny SrinidhiSeptember 23, 2019 2071 Views0

If you are in the big data or data science or BI space, you might have heard about Apache Spark. A few of you might have also heard about Apache Drill, and a tiny bit of you might have actually worke...

Read More
apache_spark
Data ScienceTech

Apache Spark SQL User Defined Function (UDF) POC in Java

Sunny SrinidhiMay 14, 2019 2867 Views2

If you’ve worked with Spark SQL, you might have come across the concept of User Defined Functions (UDFs). As the name suggests, it’s a feature where you define a function, pretty straight forward...

Read More
Sunny Srinidhi's DEV Community Profile
AWS_Community_Builder

Follow Us

  • Twitter
  • LinkedIn
  • Medium
  • GitHub

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 20 other subscribers

Recent Posts

  • I made a website which tells if you’re wearing a mask or not – without machine learning
  • Free apps vs. Paid apps
  • Binary Search Tree Implementation in Java
  • Different ways of iterating on a HashMap in Java
  • The art of load balancing – Part 2

Categories

  • Data Science (41)
  • General (4)
  • Rants (6)
  • Smartphones (1)
  • Tech (71)

Archives

  • January 2021
  • December 2020
  • October 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • December 2019
  • November 2019
  • October 2019
  • September 2019
  • June 2019
  • May 2019
  • April 2019
  • November 2018
  • August 2018
  • July 2018
  • August 2017
  • July 2017
  • June 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • September 2016
  • August 2016
  • March 2016

Tags

ai amazon apache apache kafka apache spark artificial intelligence aws best practices big data bigdata coding data science datascience data structure implementation in java data structures feature reduction feature selection java java data structures java data structures implementation java linked list example java linked list implementation javascript kafka linkedlist linked list in java linked lists machine learning machine learning models ml natural language processing nlp php programming python scikit python sklearn rants scikit scikit learn sklearn spring spring boot tech technology the fasttext series
Sunny Srinidhi | WordPress Theme Ultra Seven