INDEX
Explanations
references to prior material
New Auto-Interp
Negative Logits
Theſe
-0.99
Мексичка
-0.88
متعلقه
-0.77
hereof
-0.76
foregoing
-0.74
<bos>
-0.73
ſame
-0.72
WireFormatLite
-0.71
RSSSF
-0.71
aforesaid
-0.70
POSITIVE LOGITS
another
0.46
another
0.44
davvero
0.41
vraiment
0.41
outra
0.40
другого
0.40
['
0.39
घ
0.39
fjspx
0.39
всегда
0.38
Activations Density 2.094%