INDEX
Explanations
references to money or currency
New Auto-Interp
Negative Logits
ected
-0.16
ej
-0.16
580
-0.15
ÑĤаб
-0.15
cline
-0.14
itch
-0.14
Mast
-0.14
far
-0.14
crit
-0.14
unofficial
-0.14
POSITIVE LOGITS
ined
0.23
inous
0.21
pees
0.20
atoria
0.19
SSERT
0.19
á»ĵi
0.19
pee
0.19
ination
0.19
486
0.18
inations
0.18
Activations Density 0.013%