INDEX
Explanations
references to the city of Atlanta and its historical context
New Auto-Interp
Negative Logits
eds
-0.17
edly
-0.17
oppable
-0.16
eren
-0.16
uther
-0.16
iders
-0.15
lessly
-0.15
alez
-0.14
eden
-0.14
edor
-0.14
POSITIVE LOGITS
Georgia
0.22
Atlanta
0.21
Georgian
0.19
Georgia
0.18
inker
0.17
sou
0.17
Georg
0.16
resi
0.16
Hawks
0.16
Peach
0.16
Activations Density 0.023%