Favio Vazquez, Gregory Piatetsky

Favio Vazquez

Gregory Piatetsky

Contact Favio

Discover and connect with journalists and influencers around the world, save time on email research, monitor the news, and more.

Start free trial

Recent:
  • Unknown
Past:
  • Gregory Piatetsky
  • Towards Data Science

Past articles by Favio:

The Data Fabric for Machine Learning – Part 2: Building a Knowledge-Graph

By Favio Vazquez, Founder at Ciencia y Datos. Introduction In the last articles of the series: I’ve been talking about the data fabric in general, and giving some concepts of Machine Learning and Deep Learning in the data fabric. And also gave my definition of the data fabric: > The Data Fabric is the platform that supports all the data in the company. How it’s managed, described, combined… → Read More

The Data Fabric for Machine Learning Part 1-b – Deep Learning on Graphs

By Favio Vazquez, Founder at Ciencia y Datos. > Disclaimer: This is not the second part of the past articleon the subject; it’s a continuation of first part putting the emphasis on deep learning. Introduction We are in the process of defining a new way of doing machine learning, focusing on a new paradigm, the data fabric. In the past article I gave my new definition of machine learning: >… → Read More

The Whole Data Science World in Your Hands

By Favio Vazquez, Founder at Ciencia y Datos. I’ve been looking for years for a platform where I can run my data science projects without the pain of installations and filling my computer with dozens of different tools and environments. Luckily I found that MatrixDS has all of that and more for free! In this article I’ll be testing almost all the tools they have so you don’t have to. If you want… → Read More

Analyzing Tweets with NLP in Minutes with Spark, Optimus and Twint

By Favio Vazquez, Founder at Ciencia y Datos. Introduction If you are here it’s likely that you are interested in analyzing tweets (or something similar) and you have a lot of them, or can get them. One of the most annoying things for that is getting a Twitter application, get the authentication and all of that. And then if you are using Pandas, there’s no way to scale that. So what about a… → Read More

The Data Fabric for Machine Learning – Part 1

By Favio Vazquez, Founder at Ciencia y Datos. Read part 1-b: Deep Learning on the data fabric here: Introduction If you search for machine learning online you’ll find around 2,050,000,000 results. Yeah for real. It’s not easy to find that description or definition that fits every use or case, but there are amazing ones. Here I’ll propose a different definition of machine learning, focusing on a… → Read More

What’s Going to Happen this Year in the Data World

By Favio Vazquez, Founder at Ciencia y Datos. Introduction If you are immerse in the data world it’s probable that you’ve seen a bunch of articles, blog posts and news about what’s going to happen this and the upcoming years, the trends and expectations. I read lots of them, and if you want to do it to go to the end of the article and you’ll find it there. But in here I want to give a quick… → Read More

The 3 Biggest Mistakes on Learning Data Science

By Favio Vazquez, Founder at Ciencia y Datos. Hello! It’s me again. I’ve discussed parts of what I’m going to mention here in other articles, but now I want to give a few directions on what’s not data science and how not to learn it. So let’s start with the basics. What is Data Science? > Data science not just knowing some programming languages, math, statistics and have “domain knowledge”.… → Read More

Data Science with Optimus Part 2: Setting your DataOps Environment

By Favio Vazquez, Founder at Ciencia y Datos. Welcome back to the series of Data Science with Optimus. In the first part: We started this journey talking about Optimus, Spark and creating out environment. For that we are using MatrixDS: To have access to the repo just click below: And on MatrixDS click on Forklift: There’s also a repo on GitHub: You just have to clone it. DataOps From the great… → Read More

Learn How to Listen: One of the hardest parts of being a data scientist

By Favio Vazquez, Founder at Ciencia y Datos. One of the things they didn’t tell you when you started doing Data Science (DS) courses and MOOCs, is that a lot of your time (a looottt) will be spend in meetings. These meetings are important. Very important. There you can understand the business, the goals of the area, their KPIs, and what are the requirements for the work they want you to do.… → Read More

2018’s Top 7 Python Libraries for Data Science and AI

By Favio Vazquez, Founder at Ciencia y Datos. > Editor's note: This post covers Favio's selections for the top 7 Python libraries of 2018. Tomorrow's post will cover his top 7 R packages of the year. Introduction If you follow me, you know that this year I started a series called Weekly Digest for Data Science and AI: Python & R, where I highlighted the best libraries, repos, packages,… → Read More

Ontology and Data Science

By Favio Vazquez, Founder at Ciencia y Datos. Photo by Valentin Antonucci from Pexels If you are new to the word ontology don’t worry, I’m going to give a primer on what it is, and then why it matters for the data world. I’ll be explicit in the difference between philosophical ontology and the ontology related to information and data in computer science. Ontology (the philosophical part) In… → Read More

2018’s Top 7 R Packages for Data Science and AI

By Favio Vazquez, Founder at Ciencia y Datos. > Editor's note: This post covers Favio's selections for the top 7 R packages of 2018. Yesterday's post covered his top 7 Python libraries of the year. Introduction If you follow me, you know that this year I started a series called Weekly Digest for Data Science and AI: Python & R, where I highlighted the best libraries, repos, packages, and… → Read More

Deep Learning for the Masses (… and The Semantic Layer)

By Favio Vazquez, Founder at Ciencia y Datos. Introduction to Deep Learning The scope of this article is not introducing Deep Learning, I’ve done that in other articles you can find here: But if you want a taste of it from here this is what I can say to you: > … deep learning is representation learning using different kinds of neural networks [deep neural networks] and optimizing the… → Read More

Why do I Call Myself a Data Scientist?

By Favio Vazquez, Principal Data Scientist at OXXO. A song to enjoy while reading (by The Proclaimers, a word I’ll use a lot here): Data Science, you know that thing that everyone is talking about nowadays, and that almost everyone wants to do right now. It’s not particularly easy to define Data Science as a whole, or subject, I’ve done it in other articles: But here I want to talk about why I… → Read More

Auto-Keras, or How You can Create a Deep Learning Model in 4 Lines of Code

By Favio Vazquez, Principal Data Scientist at OXXO. Automated machine learning is the new kid in town, and it’s here to stay. It is helping us create better and better models with easy to use and great API’s. Here I’ll talk to you about Auto-Keras, the new package for AutoML with Keras. There’s a surprise in the end ;). Before starting, a great quote by Matthew Mayo about what AutoML is not:… → Read More

Manage your Machine Learning Lifecycle with MLflow  –  Part 1

By Favio Vazquez, Principal Data Scientist at OXXO. The Machine Learning Lifecycle Conundrum Machine Learning (ML) is not easy, but creating a good workflow which you can reproduce, revisit and deploy to production is even harder. There has been many advances towards creating a good platform or managing solution for ML. Note that this is not the Data Science (DS) Lifecycle, which is more complex… → Read More

The curse of “intuition” in Data Science –

Disclaimer: I’m not talking here as an expert in common sense or intuition. I’m only stating that it does not work all the time with Data Science. We have common sense: a way of seeing and… → Read More

DIY Deep Learning Projects

By Favio Vazquez, Principal Data Scientist at OXXO. LinkedIn Data Science Community Akshay Bahadur is one of the great examples that the Data Science community at LinkedIn gave. There are great people in other platforms like Quora, StackOverflow, Youtube, here, and in lots of forums and platforms helping each other in many areas of science, philosophy, math, language and of course Data Science… → Read More

Deep Learning With Apache Spark: Part 1

By Favio Vazquez, Principal Data Scientist at OXXO. A primer on Apache Spark If you work in the Data World, there’s a good chance that you know what Apache Spark is. If you don’t that’s ok! I’ll tell you what it is. Apache Spark TM. Spark, defined by its creators is a fast and general engine for large-scale data processing. The fast part means that it’s faster than previous approaches to work… → Read More

Deep Learning With Apache Spark: Part 2

By Favio Vazquez, Principal Data Scientist at OXXO. By my sister Hi everyone and welcome back to learning :). In this article I’ll continue the discussion on Deep Learning with Apache Spark. You can see the first part here. In this part I will focus entirely on the DL pipelines library and how to use it from scratch. Apache Spark Timeline The continuous improvements on Apache Spark lead us to… → Read More