INDEX
Explanations
phrases indicating strong emotional involvement or significance
recurring themes related to situations and events
New Auto-Interp
Negative Logits
assies
-0.70
eworks
-0.63
events
-0.63
seys
-0.62
giene
-0.61
paces
-0.61
ses
-0.59
Ancients
-0.58
hester
-0.58
Tanks
-0.58
POSITIVE LOGITS
unto
0.82
worthy
0.78
reminder
0.76
urable
0.76
inel
0.76
less
0.73
akin
0.71
oriented
0.71
ually
0.68
staple
0.68
Activations Density 0.299%