INDEX
Explanations
the word "though" in various contexts
New Auto-Interp
Negative Logits
XNUMX
-1.03
pleaſure
-0.90
Efq
-0.88
NUMX
-0.87
PLW
-0.86
Balt
-0.83
Majefty
-0.81
hematical
-0.81
myſelf
-0.81
PDC
-0.79
POSITIVE LOGITS
though
1.60
THOUGH
1.44
Though
1.42
Though
1.37
though
1.36
tho
1.12
through
1.01
Through
0.95
through
0.94
Tho
0.94
Activations Density 0.058%