INDEX
Explanations
phrases related to providing explanations or defining concepts
the word "where" in various contexts, indicating locations or settings
New Auto-Interp
Negative Logits
³³³³³³³³
-0.67
³³³
-0.63
rieve
-0.63
Dance
-0.61
rolet
-0.60
Epidem
-0.60
Rite
-0.58
³³³³³³³³³³³³³³³³
-0.58
Doctrine
-0.58
ME
-0.58
POSITIVE LOGITS
upon
1.57
soever
1.27
abouts
1.05
fore
1.04
owler
0.74
ever
0.73
ipl
0.73
ver
0.72
anooga
0.72
players
0.71
Activations Density 0.067%