Python, OCaml, and Machine Learning

with Laurent Mazare

Episode 5 | October 7th, 2020

A conversation with Laurent Mazare about how your choice of programming language interacts with the kind of work you do, and in particular about the tradeoffs between Python and OCaml when doing machine learning and data analysis. Ron and Laurent discuss the tradeoffs between working in a text editor and a Jupyter Notebook, the importance of visualization and interactivity, how tools and practices vary between language ecosystems, and how language features like borrow-checking in Rust and ref-counting in Swift and Python can make machine learning easier.

Laurent

Thank you for having me.

01:05:35

Ron

You can find links to some of the things that we talked about, including some of Laurent’s open source work, as well as a full transcript of the episode, along with a glossary, at signalsandthreads.com. Thanks for joining us, and see you next week.

automatic differentiation

A set of techniques for numerically evaluating the derivative of a function specified in a program.
binding

Or, language binding. An application programming interface that allows one programming language to use a library from another programming language.
cycle (garbage collection)

When two or more objects refer to each other in memory, they are said to form a cycle.
duck typing

A form of type checking that uses the presence of certain methods on and properties of an object to determine its suitability for a particular purposes.
dynamic typing

Or, dynamic type checking. The process of verifying a programs' type safety at runtime.
garbage collector

The principal actor in a form of automatic memory management.
GPU

Graphics Processing Unit, a special processor optimized for computer graphics.
linear type

A type that ensures an object is used only once, meaning its memory can be safely freed after use.
LLVM

Low Level Virtual Machine, the compiler framework used by the Swift language (among other languages).
MatPlotLib

A plotting library for Python
NumPy

A mathematical library for Python supporting large matrices and high-level functions.
pandas

A Python data manipulation library, focused on data structures for manipulating tables and time series.
pure functions

A function that will always return the same value for given arguments, and has no side effects.
PyTorch

An open-source Python library for machine learning based on the Torch library.
reinforcement learning

A machine learning paradigm concerned with building models by looking at how an agent responds to rewards and state changes with every action.
static typing

Or, static type checking. The process of verifying a program's type safety properties by analysis of the source code.
strongly typed

A language can be said to be strongly typed if it has strict typing rules at compile time.
TensorFlow

An open-source Python library for dataflow and differentiable programming, primarily used for machine learning purposes.
TPU

Tensor Processing Unit, a special processor developed by Google for neural network machine learning, further optimized for use with TensorFlow.
type safety

The extent to which a programming language prevents or discourages type errors.

Listen and subscribe:

Python, OCaml, and Machine Learning

with Laurent Mazare

Episode 5 | October 7th, 2020

00:00:03.6

Ron

00:01:25.6

Laurent

00:03:54.6

Ron

00:04:00.1

Laurent

00:05:17.7

Ron

00:05:36.1

Laurent

00:07:30.3

Ron

00:07:30.3

Laurent

00:08:50.7

Ron

00:09:21.4

Laurent

00:11:11.8

Ron

00:11:58.9

Laurent

00:12:56.7

Ron

00:14:45.5

Laurent

00:15:29.2

Ron

00:16:27.2

Laurent

00:17:51.0

Ron

00:19:16.7

Laurent

00:20:08.2

Ron

00:21:16.0

Laurent

00:23:39.8

Ron

00:24:24.1

Laurent

00:25:54.5

Ron

00:26:07.9

Laurent

00:27:43.0

Ron

00:27:59.9

Laurent

00:30:00.5

Ron

00:30:19.2

Laurent

00:30:46.5

Ron

00:32:12.1

Laurent

00:34:23.7

Ron

00:34:48.4

Laurent

00:35:30.7

Ron

00:36:42.8

Laurent

00:36:54.4

Ron

00:37:44.2

Laurent

00:38:49.5

Ron

00:40:17.7

Laurent