INDEX
Explanations
mentions of Paris and its associated institutions
the word "Paris" in various contexts.
New Auto-Interp
Negative Logits
delwed
-0.48
betweenstory
-0.45
للاسماء
-0.41
Indies
-0.40
zzleHttp
-0.39
owicz
-0.38
dersfield
-0.37
rsiniz
-0.37
Inquisition
-0.37
stående
-0.37
POSITIVE LOGITS
Paris
1.17
Paris
1.05
paris
0.91
Parisi
0.88
paris
0.88
Пари
0.82
París
0.80
PARIS
0.80
Parisian
0.80
Parigi
0.80
Activations Density 0.084%