INDEX
Explanations
mentions of countries or nationalities
mentions of countries, companies, and organizations within a context
New Auto-Interp
Negative Logits
afort
-0.70
////
-0.63
Emails
-0.60
ispers
-0.59
Sear
-0.59
immers
-0.58
Scroll
-0.57
âĶģ
-0.57
inse
-0.57
Guan
-0.57
POSITIVE LOGITS
mates
1.44
mate
1.27
mates
1.07
mate
0.87
men
0.79
leader
0.74
colleague
0.74
counterparts
0.74
ÃŃs
0.69
atana
0.66
Activations Density 0.155%