INDEX
Explanations
contractions of "does not" in text
the word "doesn't" in various contexts
New Auto-Interp
Negative Logits
DRAGON
-0.71
Dise
-0.68
ANG
-0.65
fox
-0.65
Remastered
-0.63
Mant
-0.63
Buff
-0.62
hung
-0.61
Ivory
-0.60
dar
-0.60
POSITIVE LOGITS
't
1.47
kie
0.93
paces
0.84
berra
0.78
terness
0.76
ettings
0.75
ema
0.74
ajor
0.74
uts
0.73
ates
0.73
Activations Density 0.038%