INDEX
Explanations
information related to personal attributes or descriptions of individuals
the phrase "who is" followed by a description or status of individuals
New Auto-Interp
Negative Logits
atche
-0.79
ivable
-0.67
phis
-0.66
efined
-0.66
ometers
-0.65
lations
-0.65
heastern
-0.65
Topics
-0.64
reat
-0.64
sers
-0.64
POSITIVE LOGITS
overseeing
0.86
studying
0.83
stationed
0.81
married
0.80
unmarried
0.79
fluent
0.79
divorced
0.79
suing
0.79
autistic
0.79
divor
0.78
Activations Density 0.132%