INDEX
Explanations
references to blogs
occurrences of the word "blog."
New Auto-Interp
Negative Logits
Lauder
-0.75
BuyableInstoreAndOnline
-0.72
evils
-0.64
inho
-0.62
Baal
-0.61
odka
-0.60
Scotia
-0.60
chloride
-0.60
overpowered
-0.59
Yin
-0.59
POSITIVE LOGITS
osphere
1.33
gers
1.27
ging
1.13
ged
1.13
ger
1.09
post
1.06
spot
1.01
gments
0.94
posts
0.93
posts
0.92
Activations Density 0.035%