INDEX
Explanations
expressions of personal experience or inner thoughts
first-person singular pronouns indicating personal experiences or feelings
New Auto-Interp
Negative Logits
INGTON
-0.61
Kelvin
-0.58
Philipp
-0.58
Shelby
-0.55
Vald
-0.54
Alternative
-0.54
Ep
-0.53
Jarrett
-0.53
Aberdeen
-0.53
minster
-0.53
POSITIVE LOGITS
'm
1.41
've
1.30
suppose
1.20
'll
1.19
'd
1.08
guess
1.01
ggy
0.96
rises
0.92
RL
0.89
ANA
0.89
Activations Density 0.388%