INDEX
Explanations
the name "Rubin" at varying activation levels
names of individuals, particularly those with high visibility or influence
New Auto-Interp
Negative Logits
Aboriginal
-0.74
pregn
-0.73
erala
-0.73
Georgian
-0.69
alach
-0.69
calling
-0.66
holding
-0.65
NSW
-0.64
LEASE
-0.63
rights
-0.63
POSITIVE LOGITS
Rubin
1.00
ophone
0.90
baum
0.90
stein
0.86
icals
0.83
ules
0.82
feld
0.79
ucci
0.79
bach
0.78
ozo
0.77
Activations Density 0.009%