[1.5, -0.4, 7.2, 19.6, 20.2, 1.7, -0.3, 6.9, 19.1, 21.1]
[1.5, -0.4, 7.2, 19.6, 20.2, 1.7, -0.3, 6.9, 19.1, 21.1]
source: @3blue1brown
The vectors are encoding meaning of the things in some way
red
green
blue
red
green
blue
red
green
blue
red
green
blue
red
green
blue
Each number represents how much red, green, or blue is in the color.
This is exactly what a vector embedding is - a sequence of numbers that represents meaning.
red
green
blue
Uses a neural network to learn word associations from a large corpus of text
(it was initially trained by Google with 100 billion words)
Uses the transformer architecture, taking the entire input text into account to modify each word by the surrounding text.
(the basis of all modern machine learning)
accessed through an API
hosted locally
with Data Science Dojo