INDEX
Explanations
the word "So" in the text
instances of the word "so" used to connect or introduce statements
New Auto-Interp
Negative Logits
ropolitan
-0.64
royalty
-0.61
tongue
-0.59
rhy
-0.59
"],"
-0.59
realities
-0.57
territory
-0.57
enment
-0.57
inch
-0.56
Purg
-0.56
POSITIVE LOGITS
aps
0.94
FTWARE
0.94
fter
0.91
yip
0.91
forth
0.90
letes
0.89
oner
0.85
bered
0.81
romeda
0.80
ooo
0.80
Activations Density 0.080%