INDEX
Explanations
references to bridges in various contexts
New Auto-Interp
Negative Logits
Svensk
-0.79
pitié
-0.78
Romains
-0.73
aureus
-0.73
sauvages
-0.72
Ino
-0.71
sauvage
-0.69
imageNamed
-0.68
femmin
-0.67
Moly
-0.67
POSITIVE LOGITS
bridges
1.71
Bridges
1.66
Bridges
1.64
bridge
1.56
Bridge
1.52
Bridge
1.44
BRIDGE
1.40
bridges
1.40
BRID
1.39
bridge
1.36
Activations Density 0.079%