INDEX
Explanations
instances of numerical values
New Auto-Interp
Negative Logits
########.
-0.77
]]
-0.74
ГЛА
-0.70
transfieras
-0.69
تقاوى
-0.68
Paglinawan
-0.68
Blok
-0.67
متعلقه
-0.66
Dati
-0.66
]>=
-0.66
POSITIVE LOGITS
२०
0.80
AndEndTag
0.79
২০
0.68
wanzig
0.68
veinte
0.67
0
0.67
coscienza
0.66
entieth
0.65
Ath
0.64
vzduchu
0.64
Activations Density 0.208%