INDEX
Explanations
comparisons and contrasts between different subjects
New Auto-Interp
Negative Logits
_HI
-0.16
afort
-0.16
;element
-0.15
resi
-0.15
folio
-0.15
fen
-0.15
fol
-0.14
ibar
-0.14
елеÑĦ
-0.14
ante
-0.14
POSITIVE LOGITS
Stuart
0.16
Kem
0.15
elsewhere
0.15
TMPro
0.15
later
0.15
iev
0.15
Lud
0.15
346
0.14
866
0.14
692
0.14
Activations Density 0.202%