INDEX
Explanations
proper nouns, especially names and organizations
New Auto-Interp
Negative Logits
اÙĦØ«
-0.15
Ỽi
-0.15
ória
-0.15
oodle
-0.15
nton
-0.15
xBF
-0.14
_DETECT
-0.14
unei
-0.14
EventListener
-0.14
atura
-0.14
POSITIVE LOGITS
SG
0.23
AG
0.22
tog
0.21
Lag
0.21
LAG
0.20
agog
0.20
lg
0.19
Kag
0.19
AGR
0.18
LG
0.18
Activations Density 0.183%