INDEX
Explanations
locations and landmarks in Paris
New Auto-Interp
Negative Logits
French
-0.16
eneg
-0.15
French
-0.15
ovah
-0.15
olid
-0.14
cest
-0.14
cean
-0.14
french
-0.14
bart
-0.14
Compound
-0.14
POSITIVE LOGITS
Place
0.28
Invalid
0.25
Pig
0.24
Sac
0.23
Tu
0.21
Tro
0.21
Place
0.20
Luxembourg
0.20
Quart
0.19
Left
0.18
Activations Density 0.092%