INDEX
Explanations
negations and their context in sentences
New Auto-Interp
Negative Logits
are
-0.67
Groves
-0.67
gasus
-0.66
op
-0.66
cartes
-0.63
dragen
-0.63
OP
-0.61
Lev
-0.61
является
-0.61
Goodman
-0.60
POSITIVE LOGITS
useAppContext
1.06
pleaſure
1.01
"])
1.01
raiſ
1.01
purpoſe
1.00
Anſ
0.98
iſt
0.97
}))
0.95
faſt
0.95
dreamstime
0.93
Activations Density 0.147%