INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.07
3:0.07
4:0.07
5:0.08
6:0.09
7:0.08
8:0.07
9:0.08
10:0.08
11:0.07
Negative Logits
abama
-2.64
usc
-2.60
Dover
-2.54
querade
-2.53
abbage
-2.51
annabin
-2.50
attm
-2.48
ertodd
-2.40
ologne
-2.38
actus
-2.35
POSITIVE LOGITS
Vive
2.56
.�
2.42
Omn
2.41
swapped
2.38
orange
2.33
assetsadobe
2.33
Copy
2.32
Zucker
2.31
orean
2.30
Raptors
2.28
Activations Density 0.000%
No Known Activations
This feature has no known activations.