INDEX
Explanations
descriptions or narrations of personal experiences or stories
New Auto-Interp
Negative Logits
ortium
-0.42
Cosponsors
-0.40
Revision
-0.36
Econom
-0.35
Historically
-0.34
Journalism
-0.33
Polit
-0.33
Reconstruction
-0.32
nutshell
-0.32
ylum
-0.32
POSITIVE LOGITS
undet
0.42
whatever
0.39
afterwards
0.38
refill
0.37
eternity
0.37
erection
0.37
snack
0.36
omever
0.36
atever
0.35
whichever
0.35
Activations Density 37.810%