INDEX
Explanations
references to locations, universities, and organizations
names of locations and institutions, particularly in relation to France and Berkeley
New Auto-Interp
Negative Logits
Äĩ
-0.78
sterdam
-0.73
iland
-0.71
aceae
-0.71
abad
-0.69
lio
-0.68
atem
-0.67
omore
-0.65
wake
-0.64
conn
-0.63
POSITIVE LOGITS
Cable
0.71
Arcade
0.62
ãĥ¼ãĥĨ
0.61
Levant
0.61
Falcon
0.61
çİĭ
0.59
Ultron
0.59
ij士
0.58
Asgard
0.58
Viper
0.57
Activations Density 0.428%