INDEX
Explanations
phrases related to a central concept or idea within a given context
mentions of the word "premise."
New Auto-Interp
Negative Logits
vor
-0.84
sung
-0.76
abbling
-0.70
vals
-0.69
eder
-0.68
eded
-0.66
har
-0.65
ocker
-0.64
NetMessage
-0.64
Journals
-0.63
POSITIVE LOGITS
premise
1.42
assumption
0.75
proposition
0.74
REC
0.74
lessly
0.68
underpin
0.68
SourceFile
0.68
premises
0.66
guiActiveUn
0.66
principle
0.65
Activations Density 0.005%