INDEX
Explanations
references to famous landmarks, specifically the Eiffel Tower
New Auto-Interp
Negative Logits
anu
-1.09
aml
-1.01
udeb
-0.97
mble
-0.95
vati
-0.95
hran
-0.93
pport
-0.93
asu
-0.92
icz
-0.92
emo
-0.92
POSITIVE LOGITS
flush
0.70
Genius
0.70
Cruise
0.69
Bullets
0.67
bailout
0.66
Islands
0.65
Playoff
0.63
Refuge
0.61
Valley
0.61
heartbeat
0.61
Activations Density 0.248%