INDEX
Explanations
statements of existence or propositions involving 'is' to indicate affirmation or certainty
New Auto-Interp
Negative Logits
belong
-0.59
encodeWith
-0.57
belonging
-0.50
принадле
-0.46
belong
-0.46
Belong
-0.43
belongs
-0.41
baz
-0.40
もので
-0.40
belongs
-0.40
POSITIVE LOGITS
true
0.90
reflected
0.79
why
0.79
untrue
0.75
evidenced
0.73
corroborated
0.70
true
0.68
exemplified
0.68
truer
0.67
especially
0.65
Activations Density 0.481%