INDEX
Explanations
mentions of a specific person
words related to a specific individual's name or identity
New Auto-Interp
Negative Logits
©¶æ
-0.97
é¾įå
-0.75
ISA
-0.67
conn
-0.66
ACH
-0.66
mine
-0.66
ITNESS
-0.65
merce
-0.64
ISE
-0.64
Downloadha
-0.64
POSITIVE LOGITS
atic
1.32
atically
0.98
atics
0.96
adic
0.76
ity
0.75
contrasts
0.71
atical
0.68
quished
0.67
aneous
0.67
etic
0.66
Activations Density 0.015%