INDEX
Explanations
punctuation marks representing certainty or agreement
statements that affirm or assert facts
New Auto-Interp
Negative Logits
wagon
-0.76
dun
-0.73
delinquent
-0.68
mud
-0.67
unseen
-0.65
unob
-0.64
oing
-0.64
tranquil
-0.64
sling
-0.63
ween
-0.63
POSITIVE LOGITS
Especially
1.08
But
1.07
Because
1.07
Unless
1.06
Whereas
1.01
Absolutely
0.99
Certainly
0.98
However
0.98
Though
0.96
Whenever
0.95
Activations Density 0.320%