INDEX
Explanations
phrases related to government and political theories
words that begin with prefixes related to forgiving or forgetting, including forms of the root "forgive" or similar constructs
New Auto-Interp
Negative Logits
rons
-0.47
ries
-0.43
rice
-0.41
clerks
-0.40
instincts
-0.39
Weiner
-0.38
ruling
-0.38
entrances
-0.38
directions
-0.37
result
-0.37
POSITIVE LOGITS
ets
0.61
ibility
0.58
oul
0.55
igning
0.54
eme
0.51
DonaldTrump
0.50
icker
0.50
igned
0.50
pler
0.49
eling
0.49
Activations Density 7.731%