INDEX
Explanations
terms related to language learning and proficiency
New Auto-Interp
Negative Logits
ambilan
-0.66
findpost
-0.66
bestos
-0.63
désolés
-0.63
negroes
-0.61
VersionUID
-0.60
magasiner
-0.60
berätt
-0.58
ionizing
-0.57
сылкі
-0.56
POSITIVE LOGITS
sia
0.50
0.50
TET
0.45
server
0.45
border
0.45
str
0.45
dain
0.45
ceus
0.45
Fluent
0.44
Typing
0.44
Activations Density 0.038%