INDEX
Explanations
the word "as" and its variations in context
New Auto-Interp
Negative Logits
raud
-0.17
ewis
-0.16
its
-0.16
abal
-0.14
MPU
-0.14
ewise
-0.14
itsu
-0.14
лÑİб
-0.14
اذ
-0.14
elda
-0.14
POSITIVE LOGITS
ebe
0.16
613
0.15
showc
0.15
andon
0.15
well
0.14
opposed
0.14
utor
0.14
aina
0.14
precedent
0.14
طب
0.14
Activations Density 0.050%