INDEX
Explanations
comparative words indicating varying degrees
instances of the word "way" in various contexts
New Auto-Interp
Negative Logits
tein
-0.76
uster
-0.74
usters
-0.72
prosec
-0.70
encer
-0.68
lict
-0.67
iners
-0.67
livest
-0.67
encers
-0.63
querque
-0.63
POSITIVE LOGITS
fare
1.08
finding
1.00
ward
0.99
point
0.98
points
0.93
forward
0.88
bos
0.83
WARD
0.79
bill
0.77
YY
0.76
Activations Density 0.035%