INDEX
Explanations
the word "Roeder", as it appears multiple times with high activations
words related to confederation or entities with "Feder" in them
New Auto-Interp
Negative Logits
smartphones
-0.65
clarity
-0.63
summons
-0.63
Floor
-0.62
overtime
-0.61
satisfaction
-0.59
slowdown
-0.59
haze
-0.59
360
-0.57
gears
-0.57
POSITIVE LOGITS
eder
4.52
ederation
2.02
eding
1.56
eds
1.34
ede
1.30
ederal
1.23
Feder
1.22
Confeder
1.15
edes
1.09
rer
1.04
Activations Density 0.006%