INDEX
Explanations
references to a specific location called "Tahrir Square"
mentions of specific locations, particularly "Tahrir Square" and "Tulip."
New Auto-Interp
Negative Logits
mble
-0.86
wagen
-0.72
ISTER
-0.69
heid
-0.67
fixation
-0.67
ancial
-0.64
ablishment
-0.64
quirks
-0.63
cov
-0.63
bilt
-0.61
POSITIVE LOGITS
rir
1.04
oya
0.99
anus
0.93
ango
0.93
essa
0.92
Tah
0.91
iti
0.85
ania
0.82
ti
0.82
ibia
0.82
Activations Density 0.022%