INDEX
Explanations
phrases discussing efficacy and assessments of policies or treatments
Comes after "while"
while X is/has
New Auto-Interp
Negative Logits
either
-0.80
Either
-0.79
even
-0.78
either
-0.77
Either
-0.77
even
-0.71
prostu
-0.65
inoltre
-0.65
invece
-0.65
entweder
-0.65
POSITIVE LOGITS
technically
1.31
nominally
1.17
ostensibly
1.08
outwardly
1.03
admittedly
1.00
theoretically
0.99
superfic
0.96
initially
0.92
may
0.91
téc
0.88
Activations Density 0.504%