INDEX
Explanations
numerical data and associated units or percentages
New Auto-Interp
Negative Logits
5
-0.58
dorf
-0.58
pied
-0.57
poch
-0.57
kut
-0.56
9
-0.56
inne
-0.56
sotto
-0.56
otten
-0.54
8
-0.54
POSITIVE LOGITS
SBATCH
0.71
ైన
0.68
vidare
0.67
ื่อน
0.64
perry
0.63
kuuta
0.63
član
0.62
religieuses
0.62
SdkVersion
0.62
텝
0.61
Activations Density 0.296%