INDEX
Explanations
mentions of specific locations, particularly junctions
references to junctions and their associated contexts
New Auto-Interp
Negative Logits
Winner
-0.78
Panic
-0.77
rek
-0.71
yre
-0.70
liquid
-0.68
anyahu
-0.67
ogun
-0.66
isable
-0.65
galitarian
-0.64
istor
-0.63
POSITIVE LOGITS
jun
1.44
cture
1.29
junction
1.04
ctions
0.89
iors
0.84
eteenth
0.84
iper
0.83
ames
0.74
uke
0.74
iet
0.73
Activations Density 0.008%