INDEX
Explanations
mentions of a specific name "Lane"
mentions of the name "Lane" with varying significance
New Auto-Interp
Negative Logits
ERAL
-0.76
ulatory
-0.75
ority
-0.71
anamo
-0.71
rador
-0.68
irements
-0.67
ional
-0.66
Seym
-0.66
translation
-0.66
raints
-0.65
POSITIVE LOGITS
Lane
1.20
lane
1.14
leigh
0.75
leys
0.74
hattan
0.71
illard
0.69
bridge
0.68
sey
0.68
bys
0.68
©¶æ
0.68
Activations Density 0.005%