INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.06
1:0.08
2:0.08
3:0.07
4:0.07
5:0.09
6:0.09
7:0.08
8:0.08
9:0.08
10:0.07
11:0.09
Negative Logits
ebted
-1.41
Riy
-1.41
Winc
-1.36
enza
-1.34
Nep
-1.34
phia
-1.28
leep
-1.26
hematic
-1.25
owes
-1.24
igi
-1.24
POSITIVE LOGITS
EStream
1.54
ullivan
1.52
Topic
1.40
Carson
1.39
](
1.36
uer
1.34
soType
1.30
discouraging
1.30
AZ
1.28
��
1.27
Activations Density 0.000%
No Known Activations
This feature has no known activations.