INDEX
Explanations
names of individuals, potentially associated with legal or media contexts
mentions of specific individuals, particularly the name "Rosen."
New Auto-Interp
Negative Logits
ŃĶ
-0.99
DERR
-0.82
uate
-0.79
oran
-0.74
è¦ļéĨĴ
-0.73
ĪĴ
-0.71
acular
-0.70
newcom
-0.69
lain
-0.69
į
-0.66
POSITIVE LOGITS
lette
0.93
cker
0.83
heet
0.82
qv
0.81
ball
0.79
vention
0.78
stal
0.78
bern
0.77
BILITIES
0.77
enthal
0.75
Activations Density 0.033%