INDEX
Explanations
conditional statements indicating possibilities or conditions
conditional phrases that imply specific scenarios or situations
New Auto-Interp
Negative Logits
quit
-0.69
gur
-0.67
hibited
-0.64
activate
-0.63
continue
-0.62
hole
-0.61
ude
-0.60
helial
-0.60
lez
-0.60
aunder
-0.59
POSITIVE LOGITS
you
0.91
accompanied
0.87
coupled
0.85
there
0.83
compared
0.82
paired
0.78
contrasted
0.77
they
0.75
someone
0.74
somebody
0.72
Activations Density 0.144%