Analyse Text and CSVs easily with DataBasic.io

Jessica YungData ScienceLeave a Comment

In today’s post I’m going to explore a fun data analysis tool for beginners – DataBasic.io. I had a great time trying it out – you should have a go too! DataBasic performs simple but insightful operations on data. No technical expertise (beyond being able to navigate a webpage) is required. DataBasic comprises three tools to help you understand textual and … Read More

Machine Learning in Trading – Project Takeaways

Jessica YungData Science1 Comment

People have used machine learning in trading for decades. Hedge funds, high-frequency trading shops and sole traders use all sorts of strategies, from Bayesian statistics to physics related strategies. In my final project for Udacity’s Machine Learning Nanodegree, I investigating using machine learning in trading stocks, specifically to predict British Petroleum (BP) stock prices on the London Stock Exchange (LSE) … Read More

Dennis Mortensen, Part 1: Frameworks for looking at the AI Market

Jessica YungData Science, Talk Reviews2 Comments

dennis-mortensen

Last weekend (24th-25th Sept) was the second Ai.WithTheBest conference – an online conference about artificial intelligence. Over two days, speakers gave talks (often from their homes) with live Q&A. Dennis Mortensen of x.ai kicked off this year’s conference with three frameworks for looking at AI products. He gave two frameworks for looking at the AI market and one framework for … Read More

Questions to ask when deciding how to approach predictive problems

Jessica YungData ScienceLeave a Comment

Is the situation stochastic or deterministic? Is it time-inhomogeneous? (Different across time?) How much data do you have available? What limitations are there with respect to computational cost (compute and time), both for training and predicting? Do you need to try actions to learn about situations? (If so, consider Reinforcement Learning.) Do your actions have an impact on the environment? … Read More

Udacity Connect Review (London)

Jessica YungData ScienceLeave a Comment

Udacity Connect are in-person meet-ups to supplement Udacity’s Nanodegrees, online certifications that consist of a series of courses and graded projects. (Udacity is an online educational organisation that offers technology-centred online courses.) After piloting in the US over the summer, Udacity Connect launched in London last week. In this post I describe what happened at the second(my first) Machine Learning … Read More

What data does Facebook.com load?

Jessica YungData Science, HighlightsLeave a Comment

Today we’re going to look at your Facebook homepage’s source code. This is interesting because it gives you an idea of the data Facebook is using every time you load your Newsfeed and accompanying ticker and chat windows, what exactly this data is, and how this data is stored and formatted. Here’s an example: (Scroll down for step-by-step instructions on … Read More

Removing Outliers from your Data

Jessica YungData ScienceLeave a Comment

Hastily compiled, from uDacity’s Intro to Machine Learning videos. Here’s a general recipe for removing outliers from your data: 1. Train with all data. 2. Remove ~10% of data (points with highest residual error). 3. Train again. Obviously don’t remove outliers blindly – sometimes they are important and you should pay attention to them. But outliers that are results of … Read More