INDEX
Explanations
phrases indicating cause and effect
the word "as" used in various contexts conveying comparison or effect
New Auto-Interp
Negative Logits
whatsoever
-0.69
origin
-0.61
âϦ
-0.61
atis
-0.60
gger
-0.57
leans
-0.56
opian
-0.56
abouts
-0.56
regon
-0.55
ALLY
-0.55
POSITIVE LOGITS
pects
1.16
ymm
1.16
semb
1.13
piring
1.08
ynchronous
1.06
bestos
1.05
phalt
1.01
piration
0.99
such
0.95
semble
0.95
Activations Density 0.077%