INDEX
Explanations
instances of the word "for" at the end of sentences
phrases indicating something is being sought after
New Auto-Interp
Negative Logits
mort
-0.72
founded
-0.69
roach
-0.68
alone
-0.68
SourceFile
-0.67
edition
-0.67
section
-0.67
oller
-0.66
Democr
-0.65
operated
-0.65
POSITIVE LOGITS
clues
0.92
WARD
0.91
opportunities
0.80
bidden
0.79
omething
0.78
alternatives
0.77
ways
0.75
gotten
0.75
forgiveness
0.75
loopholes
0.74
Activations Density 0.041%