INDEX
Explanations
references to specific names or individuals, particularly "Curry" and "Lopez"
mentions of the names "Curry" and "Lopez."
New Auto-Interp
Negative Logits
ãĥĨãĤ£
-0.76
igor
-0.76
itational
-0.75
rador
-0.73
natureconservancy
-0.71
è¦ļéĨĴ
-0.71
aples
-0.69
tered
-0.67
notes
-0.66
oops
-0.65
POSITIVE LOGITS
Ô
0.78
cember
0.77
Yiannopoulos
0.75
legate
0.73
ĨĴ
0.72
Clicker
0.71
esome
0.69
Kos
0.66
elson
0.65
Trin
0.63
Activations Density 0.034%