INDEX
Explanations
first person singular and plural pronouns with words related to personal actions and experiences
personal pronouns and references to individual experiences or actions
New Auto-Interp
Negative Logits
rontal
-0.77
heit
-0.64
blooded
-0.62
targ
-0.61
livest
-0.60
entric
-0.60
enthusi
-0.59
puter
-0.59
Americ
-0.59
Travels
-0.58
POSITIVE LOGITS
Learned
0.96
lacks
0.87
lacked
0.86
REALLY
0.82
really
0.82
Means
0.80
uncovered
0.80
really
0.80
missing
0.79
mean
0.77
Activations Density 0.092%