INDEX
Explanations
mentions of "Ball" as a significant entity or reference
references to the term "Ball" with varying activations
New Auto-Interp
Negative Logits
liness
-0.72
profit
-0.66
ths
-0.65
éĹĺ
-0.64
terday
-0.63
æĢ
-0.62
CRIP
-0.61
ktop
-0.61
åº
-0.60
iosyncr
-0.59
POSITIVE LOGITS
oons
1.24
park
1.00
oon
0.96
aign
0.92
arat
0.91
antine
0.88
haus
0.88
istics
0.87
assic
0.83
asts
0.83
Activations Density 0.010%