INDEX
Explanations
phrases indicating a point in time or perspective
New Auto-Interp
Negative Logits
twin
-0.68
twins
-0.63
amaz
-0.61
harm
-0.61
Cities
-0.60
ours
-0.59
lez
-0.58
discrimination
-0.56
warranties
-0.56
subst
-0.56
POSITIVE LOGITS
cture
0.86
onwards
0.83
onward
0.77
ebin
0.70
abouts
0.70
endment
0.69
ozo
0.63
ajor
0.63
gue
0.62
EStreamFrame
0.61
Activations Density 0.035%