INDEX
Explanations
references to the country "France."
mentions of France
New Auto-Interp
Negative Logits
iary
-0.92
regor
-0.88
atari
-0.81
iating
-0.80
âĹ¼
-0.78
ividual
-0.78
uilt
-0.77
ramid
-0.77
iated
-0.77
iations
-0.77
POSITIVE LOGITS
Alps
0.87
Hollande
0.83
fries
0.82
France
0.80
Riv
0.79
Mé
0.76
Marse
0.75
France
0.72
Franc
0.72
countryside
0.69
Activations Density 0.016%