INDEX
Explanations
phrases indicating causality
the word "so" used as a connector or transition in sentences
New Auto-Interp
Negative Logits
mast
-0.62
Mens
-0.61
ammy
-0.59
inch
-0.59
女
-0.58
Wer
-0.56
Halls
-0.56
kb
-0.55
silhouette
-0.53
Souls
-0.53
POSITIVE LOGITS
bered
1.22
oner
1.14
othe
1.14
apy
1.12
othes
0.99
aps
0.92
oooo
0.88
arin
0.87
ooo
0.85
oths
0.82
Activations Density 0.067%