INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.07
3:0.07
4:0.06
5:0.08
6:0.09
7:0.09
8:0.07
9:0.08
10:0.08
11:0.07
Negative Logits
Lev
-2.84
NEC
-2.79
Lite
-2.72
Mutant
-2.54
reader
-2.48
Hebrew
-2.39
ripple
-2.34
Prediction
-2.33
Lev
-2.32
viral
-2.29
POSITIVE LOGITS
|--
3.32
psc
3.10
uties
3.01
rentices
2.93
Frames
2.91
ayne
2.83
frames
2.79
fram
2.74
retty
2.73
unte
2.64
Activations Density 0.000%
No Known Activations
This feature has no known activations.