INDEX
Explanations
people's names along with their associated titles or roles
instances of the verb "was" indicating past experiences or states of being
New Auto-Interp
Negative Logits
eva
-0.78
olutions
-0.78
ado
-0.70
itage
-0.70
ieth
-0.67
]]
-0.66
Needs
-0.64
idium
-0.63
haven
-0.63
Retrieved
-0.63
POSITIVE LOGITS
nicknamed
1.11
fascinated
1.10
obsessed
1.10
able
1.10
known
1.03
proficient
1.02
bullied
1.01
adept
1.01
rumored
0.99
fluent
0.98
Activations Density 0.272%