INDEX
Explanations
mentions of specific names or titles
specific names and terms related to people or entities, particularly in legal or academic contexts
New Auto-Interp
Negative Logits
Alger
-0.88
Yemen
-0.81
Yemeni
-0.78
Algeria
-0.76
egal
-0.67
213
-0.66
Ń
-0.65
Slash
-0.65
aeda
-0.64
ALD
-0.64
POSITIVE LOGITS
Pok
1.85
Pom
1.60
Pik
1.54
Pon
1.52
Pere
1.46
Pike
1.43
PN
1.43
PP
1.41
P
1.40
Pine
1.39
Activations Density 0.082%