INDEX
Explanations
conjunctions and accompanying phrases that connect ideas or concepts
New Auto-Interp
Negative Logits
寧
-0.14
Sle
-0.14
sat
-0.13
ECTOR
-0.13
anel
-0.13
ANEL
-0.13
á»įt
-0.13
ubb
-0.12
culate
-0.12
nero
-0.12
POSITIVE LOGITS
although
0.24
although
0.21
nowhere
0.18
this
0.18
Although
0.17
whereas
0.17
though
0.17
it
0.16
Although
0.16
it
0.16
Activations Density 0.341%