INDEX
Explanations
statements related to actions or events happening to certain entities
various forms of the verb "to be."
New Auto-Interp
Negative Logits
IMAGES
-0.68
Yourself
-0.67
violates
-0.66
Prepar
-0.64
Unless
-0.64
ãĢı
-0.63
Acquisition
-0.62
Ends
-0.59
Defeat
-0.59
®
-0.59
POSITIVE LOGITS
likewise
1.16
meanwhile
1.09
similarly
0.99
also
0.84
onite
0.80
notably
0.75
attest
0.73
reportedly
0.72
doubtless
0.70
ŃĶ
0.70
Activations Density 0.736%