INDEX
Explanations
proper names, specifically related to individuals and locations
New Auto-Interp
Negative Logits
_IMPLEMENT
-0.16
oose
-0.15
kova
-0.15
NR
-0.15
pylint
-0.14
alfa
-0.14
tie
-0.14
args
-0.14
áb
-0.14
CHA
-0.13
POSITIVE LOGITS
Mell
0.17
eo
0.15
éĺª
0.14
conj
0.14
Vere
0.14
mani
0.14
ermo
0.14
meli
0.14
orno
0.13
.scenes
0.13
Activations Density 0.023%