INDEX
Explanations
the phrase "by the way"
phrases that introduce additional comments or asides
New Auto-Interp
Negative Logits
ĸļ
-1.01
ufact
-0.83
anmar
-0.80
usters
-0.79
omore
-0.78
uster
-0.77
natureconservancy
-0.75
osponsors
-0.73
arij
-0.71
incinn
-0.71
POSITIVE LOGITS
point
0.80
ward
0.77
points
0.77
KEY
0.70
Pyth
0.68
Remastered
0.68
sey
0.67
heads
0.66
WARD
0.66
liness
0.64
Activations Density 0.012%