INDEX
Explanations
words related to specific locations or landmarks, particularly "Tahrir Square" in Cairo, Egypt
references to the location "Tahrir Square."
New Auto-Interp
Negative Logits
states
-0.74
rules
-0.73
Referred
-0.71
Proced
-0.70
Prospect
-0.70
Nationwide
-0.67
Choice
-0.67
Practice
-0.67
Goal
-0.66
Investors
-0.65
POSITIVE LOGITS
rir
1.56
unda
1.03
unia
0.98
ée
0.92
ascus
0.92
xtap
0.90
andom
0.89
seless
0.88
rall
0.86
hod
0.86
Activations Density 0.008%