INDEX
Explanations
exactly the word "Bruce" with high activation
references to individuals named Bruce
New Auto-Interp
Negative Logits
esan
-0.73
FIX
-0.72
âĶĢ
-0.71
IX
-0.67
á
-0.66
isen
-0.66
excise
-0.66
station
-0.65
mates
-0.65
Erit
-0.63
POSITIVE LOGITS
Bruce
3.51
Bruce
3.05
Clint
1.45
Batman
1.38
Wayne
1.36
Daryl
1.30
Barbara
1.30
Batman
1.28
Terry
1.22
Barry
1.21
Activations Density 0.023%