INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.09
4:0.09
5:0.08
6:0.07
7:0.08
8:0.08
9:0.08
10:0.07
11:0.07
Negative Logits
pony
-2.75
────
-2.74
snail
-2.61
tube
-2.60
barn
-2.54
.」
-2.52
worms
-2.52
reluctantly
-2.40
wedd
-2.39
patiently
-2.37
POSITIVE LOGITS
Interior
2.71
Cities
2.67
ivic
2.63
Kaepernick
2.54
Park
2.52
Basketball
2.42
Schools
2.37
Edu
2.35
Mental
2.33
Geh
2.31
Activations Density 0.000%
No Known Activations
This feature has no known activations.