INDEX
Explanations
contractions ('aren't', 'won't', 'isn't', etc.)
negations or contractions related to the state of being
New Auto-Interp
Negative Logits
anwhile
-0.86
enthusi
-0.83
destro
-0.78
princ
-0.78
eleph
-0.76
gobl
-0.76
newcom
-0.74
unnecess
-0.69
exha
-0.69
metic
-0.68
POSITIVE LOGITS
't
1.73
ited
0.88
ÃŃ
0.88
´
0.87
iting
0.86
itely
0.85
atically
0.83
Dispatch
0.80
uts
0.79
acio
0.79
Activations Density 0.082%