INDEX
Explanations
the word "rebut"
phrases related to the act of being removed or excluded
New Auto-Interp
Negative Logits
SHIP
-0.82
Grayson
-0.65
Carbuncle
-0.64
CONTIN
-0.61
redistributed
-0.61
Jenna
-0.61
attendance
-0.61
Flowers
-0.59
MacArthur
-0.59
Barker
-0.59
POSITIVE LOGITS
ut
4.24
uts
2.48
UT
2.13
utor
1.95
uta
1.92
uti
1.88
utan
1.87
uto
1.84
uter
1.81
utt
1.79
Activations Density 0.011%