INDEX
Explanations
possessives and contractions
New Auto-Interp
Negative Logits
to
1.47
то
1.13
ä
1.05
ме
1.02
er
1.00
л
1.00
č
0.98
ре
0.96
il
0.95
to
0.95
POSITIVE LOGITS
I
1.07
in
0.93
IAN
0.76
ні
0.69
baseHP
0.67
IER
0.66
E
0.66
dependency
0.65
กฎ
0.65
eiusmod
0.64
Activations Density 0.207%