INDEX
Explanations
first-person singular pronouns and verbs
expressions of regret or self-reflection
New Auto-Interp
Negative Logits
ewitness
-0.64
repud
-0.60
ceivable
-0.59
pastoral
-0.58
pestic
-0.58
stewards
-0.57
corros
-0.57
Accessed
-0.57
credential
-0.57
valued
-0.57
POSITIVE LOGITS
haha
1.01
Anyway
0.98
laughs
0.92
kinda
0.92
yeah
0.89
:(
0.88
anyways
0.86
ya
0.85
XD
0.82
OIL
0.80
Activations Density 0.828%