INDEX
Explanations
phrases ending with a period and quotes
periods at the end of sentences
New Auto-Interp
Negative Logits
akia
-0.81
ributes
-0.70
retreat
-0.68
uder
-0.68
hemer
-0.67
aper
-0.66
arest
-0.66
pill
-0.63
reci
-0.63
apers
-0.63
POSITIVE LOGITS
Reviewer
0.98
Flavoring
0.96
Needless
0.89
Lastly
0.87
Whereas
0.84
Conversely
0.82
Ultimately
0.81
Eventually
0.81
fixme
0.81
Alas
0.81
Activations Density 0.165%