The basics of deep learning

From maths to PyTorch.

Abstract

In the present page, mathematical concepts are explained, oriented to PyTorch.

1. Introduction

The goal of the present page is to teach the basics of supervised deep learning, oriented to using PyTorch. While there is a quick review, a previous basic understanding of probability, linear algebra, statistics and mathematical analysis is needed.

2. Useful resources

The below listed learning resources are all FREE.

For seeing a broad overview of methods and what to learn, check the AI expert roadmap.

2. Maths review

A quick recap of basic maths concepts important for machine learning models.

2.1. Linear algebra

Vectors

Leaving aside the different interpretations of this concept, a vector starts from an origin (where all coordinates intersect), and ends in a point in space, defined by each value of the different dimensions. These values are represented as numbers in an ordered list.

Concretely, for data science, a vector of features stores the different values of a data point, and the vector of weights stores the different values of the parameters of a model.

Matrices

They are a set of vectors, stored by columns.

Operations

All operations on linear algebra arise from basis vectors, which contain a 1 for a coordinate value, and all 0 for the rest.

Linear combinations

u = a v + b w a , b R ; u , v , w R N

When varying a and b values,