INDEX
Explanations
instances of direct speech or quotations from individuals
New Auto-Interp
Negative Logits
','');↵
-0.16
ares
-0.14
uset
-0.14
eland
-0.14
еле
-0.13
æĪ·
-0.13
asic
-0.13
559
-0.13
amine
-0.13
nm
-0.13
POSITIVE LOGITS
Cent
0.15
ÛĮÙĨÙĩ
0.14
reff
0.14
Ø®ÙĬ
0.14
ikal
0.14
olini
0.14
áÄį
0.14
ackbar
0.14
prov
0.13
_FMT
0.13
Activations Density 0.036%