INDEX
Explanations
references to the word "ign" in varying contexts
instances of the substring "ign."
New Auto-Interp
Negative Logits
bian
-0.78
HAHAHAHA
-0.72
Springfield
-0.68
cham
-0.67
------------------------------------------------
-0.67
Span
-0.65
Bie
-0.65
————————
-0.64
Gingrich
-0.63
successfully
-0.61
POSITIVE LOGITS
ments
1.18
ificant
1.05
antly
1.05
mentation
0.99
entials
0.98
eous
0.95
ame
0.94
atories
0.90
atures
0.90
iew
0.89
Activations Density 0.013%