INDEX
Explanations
instances of the word "However," indicating a shift or contrast in the text
New Auto-Interp
Negative Logits
resaid
-0.64
neceff
-0.63
createSlice
-0.61
dedans
-0.60
Majefty
-0.60
وانید
-0.59
ftate
-0.58
เลย
-0.56
occafion
-0.56
fhew
-0.55
POSITIVE LOGITS
demikian
0.80
مرئيه
0.73
że
0.65
&___
0.64
ⓧ
0.62
much
0.60
autorytatywna
0.58
much
0.56
sidemargin
0.56
[]:
0.56
Activations Density 0.079%