INDEX
Explanations
the word "premise" in various contexts
instances of the word "premise."
New Auto-Interp
Negative Logits
sung
-0.93
vor
-0.87
eday
-0.82
icer
-0.72
eder
-0.66
vals
-0.65
ese
-0.64
NetMessage
-0.64
pedia
-0.63
erers
-0.63
POSITIVE LOGITS
premise
1.21
premises
0.88
REC
0.70
IRE
0.69
proposition
0.67
itial
0.66
Budapest
0.66
OUND
0.65
ulhu
0.65
ALLY
0.64
Activations Density 0.019%