INDEX
Explanations
phrases related to instructions or warnings
contractions and negative forms of the verb "to do."
New Auto-Interp
Negative Logits
Pegasus
-0.74
Craigslist
-0.72
booked
-0.69
ejected
-0.65
sibling
-0.65
Hercules
-0.63
matured
-0.63
Rhodes
-0.62
wheels
-0.62
booted
-0.61
POSITIVE LOGITS
âĢ
1.96
âĢ
1.46
ãĢ
1.37
̶
1.36
Â
1.35
§
1.34
îĢ
1.34
âĢł
1.33
ÃĥÃĤ
1.30
âĶ
1.29
Activations Density 0.689%