INDEX
Explanations
references to specific individuals named Nick
mentions of the name "Nick."
New Auto-Interp
Negative Logits
ãĥ´ãĤ¡
-0.74
ASED
-0.74
redistributed
-0.73
20439
-0.72
velt
-0.69
WAYS
-0.69
à¨
-0.69
fare
-0.68
emale
-0.67
ACTED
-0.66
POSITIVE LOGITS
laus
1.31
named
1.11
ety
1.07
names
0.98
olas
0.98
imus
0.93
las
0.92
erson
0.91
ophon
0.90
y
0.87
Activations Density 0.014%