INDEX
Explanations
words prefixed with 'un'
words and phrases that start with "un-"
New Auto-Interp
Negative Logits
Ajax
-0.70
Cree
-0.66
FAR
-0.66
face
-0.65
Tut
-0.65
realism
-0.65
hinge
-0.65
rides
-0.63
fixtures
-0.63
drawer
-0.62
POSITIVE LOGITS
assuming
1.37
cles
1.37
apolog
1.34
ruly
1.34
confirmed
1.32
spoken
1.30
numbered
1.29
character
1.27
anticipated
1.27
occupied
1.27
Activations Density 0.026%