INDEX
Explanations
personal pronouns referring to individuals or groups
New Auto-Interp
Negative Logits
pregn
-0.65
dstg
-0.65
worldly
-0.63
natureconservancy
-0.60
Untitled
-0.60
duc
-0.59
causation
-0.57
guiActiveUnfocused
-0.56
CAP
-0.56
$_
-0.56
POSITIVE LOGITS
also
1.19
furthermore
1.12
pherd
0.99
additionally
0.98
therefore
0.97
moreover
0.92
also
0.92
zbollah
0.90
resy
0.88
miah
0.86
Activations Density 1.347%