INDEX
Explanations
references to individuals or groups in narratives
New Auto-Interp
Negative Logits
Ing
-0.17
omp
-0.15
aurus
-0.15
زار
-0.14
vang
-0.14
omic
-0.14
LOCKS
-0.14
ilebilir
-0.14
ÈĽi
-0.14
ingen
-0.14
POSITIVE LOGITS
tog
0.15
ziej
0.15
wand
0.15
licken
0.15
aln
0.14
codec
0.14
count
0.14
conc
0.14
Woj
0.13
asics
0.13
Activations Density 0.122%