INDEX
Explanations
mentions of physical descriptions of individuals, particularly focusing on their appearance
information related to ancestry and physical characteristics
New Auto-Interp
Negative Logits
takedown
-0.79
uer
-0.70
uers
-0.68
progress
-0.67
escal
-0.66
evaluates
-0.66
casters
-0.65
ousse
-0.64
ior
-0.64
olving
-0.64
POSITIVE LOGITS
Caucasian
0.98
surname
0.92
Caucas
0.85
aunt
0.81
surn
0.80
eldest
0.79
Gujar
0.77
Cherokee
0.76
nephew
0.76
married
0.76
Activations Density 0.398%