INDEX
Explanations
proper nouns and titles of individuals
phrases related to political figures and their actions or attributes
New Auto-Interp
Negative Logits
odge
-0.67
irie
-0.67
aways
-0.66
necessities
-0.66
bedrooms
-0.66
EMS
-0.63
cohesion
-0.63
tents
-0.62
urry
-0.62
preliminary
-0.61
POSITIVE LOGITS
who
1.61
whom
1.52
whose
1.43
who
1.42
whose
1.27
himself
1.22
Himself
1.19
someone
1.15
extraord
1.07
aka
1.07
Activations Density 0.567%