Blog

Progress in Machine Translation - Pt. 1

Category: NLP
A multi-part post mainly focusing on the build up towards Neural Machine Translation without any parallel data. Part 1 explores the roots of basic translation and some of the progress through the years.

Introduction to Word Embeddings - Pt. 1

Category: NLP
Word Embeddings are basically a vector or tensor representations of text which can then be used by various Machine Learning Algorithms for a variety of tasks. Word2vec, one of the primary algorithms in word embedding tasks, which we will focus on, was proposed almost 4 years ago. Since then, it has exploded, with a paper coming out everyday on using Word2vec for semantic classification tasks. It’s exceptionally useful and ingenious. So, why wait? Let’s get straight into it.

Dataset Shortages Pt. 1 - Introduction to Scrapy

Category: Data-Science
Deep Learning and Machine Learning are fun hobbies. Endless projects to do. Lots of Open Source Code available. Nothing to stop you. Except Datasets. If you are up to generic problems, then you will probably come across a dataset if you look hard enough. But if you are trying to solve a weird problem. Looking for a dataset can be nightmarish, but there are ways you could build your own dataset overnight.

Webdrivers - An Introduction to Selenium

Category: Data-Science
Ever seen a person do too well on an online game? Or maybe spam you messages at an inhuman speed? Well this is what Webdrivers do. Webdrivers are easy to use APIs for Web Application Testing, they allow us to control the behaviour of our browser towards sites using the provided API. People do use them for Testing Servers but there are a lot of fun uses too.