INDEX
Explanations
the presence of a specific entity or concept repeatedly mentioned in a textual context
New Auto-Interp
Negative Logits
њи
-0.47
n
-0.45
Fordítás
-0.45
achen
-0.45
cras
-0.44
Nazar
-0.44
chiff
-0.44
ály
-0.43
nom
-0.43
an
-0.43
POSITIVE LOGITS
ne
3.34
NE
2.69
Ne
2.58
Ne
2.54
ne
2.49
NE
2.22
ネ
1.43
neb
1.40
не
1.37
nect
1.33
Activations Density 0.056%