INDEX
Explanations
instances of the word "so" used as discourse markers or conjunctions
New Auto-Interp
Negative Logits
åĨĨ
-0.17
agne
-0.15
udit
-0.15
umont
-0.14
YY
-0.14
dint
-0.14
ettes
-0.14
ÏĦεÏį
-0.14
_simps
-0.14
Cleanup
-0.14
POSITIVE LOGITS
375
0.15
nowhere
0.14
aping
0.14
imd
0.14
ighet
0.13
ÏģÏį
0.13
oldt
0.13
ãģ«ãģĭ
0.13
Ïģί
0.13
bsub
0.13
Activations Density 0.097%