Age | Commit message (Collapse) | Author |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Vaile reads code
|
|
|
|
|
|
|
|
|
|
add limited supervision training (10hr)
|
|
|
|
Plot
|
|
|
|
|
|
|
|
|
|
Refactor modularize
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tokenizer
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Not distributed but still cool
|
|
|
|
|
|
For now decided against it, so I will overwrite the changes soon.
|
|
Slurm distributor attempt one
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|| |_
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature dataloading
|
|
|
|
|
|
|
|
Pherkel decided he only wants 3.10..
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|