INDEX
Explanations
phrases indicating a contrast or comparison
phrases indicating a lack of freedom or capacity
New Auto-Interp
Negative Logits
lied
-0.82
ussion
-0.76
artisan
-0.75
lication
-0.71
iard
-0.70
teasp
-0.68
got
-0.68
alez
-0.68
iosyn
-0.67
illed
-0.67
POSITIVE LOGITS
afar
0.99
anywhere
0.78
whence
0.73
domin
0.71
scratch
0.70
touching
0.65
Constantinople
0.64
ordinary
0.64
conclusive
0.63
thence
0.62
Activations Density 0.047%