INDEX
Explanations
phrases that describe sensations or comparisons
New Auto-Interp
Negative Logits
tran
-0.18
illa
-0.17
ients
-0.15
ón
-0.15
ista
-0.15
pregn
-0.15
aken
-0.14
åī²
-0.14
>\<
-0.14
roids
-0.14
POSITIVE LOGITS
Airways
0.16
ãĤ¹ãĥĨãĤ£
0.15
olley
0.15
tle
0.14
oby
0.14
ISK
0.14
Fat
0.14
Concord
0.13
adle
0.13
aria
0.13
Activations Density 0.023%