INDEX
Explanations
terms related to physical objects or activities
New Auto-Interp
Negative Logits
teen
-0.24
teenth
-0.20
cut
-0.17
tes
-0.17
fully
-0.16
ta
-0.16
-ÑĤаки
-0.16
amba
-0.16
eens
-0.16
tery
-0.15
POSITIVE LOGITS
jamin
0.23
forth
0.22
issance
0.20
egal
0.20
ultimate
0.20
ial
0.19
/disable
0.18
igma
0.17
folk
0.17
à§įà¦
0.16
Activations Density 0.278%