INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.08
3:0.08
4:0.08
5:0.06
6:0.09
7:0.08
8:0.07
9:0.08
10:0.08
11:0.08
Negative Logits
fertil
-1.58
fal
-1.56
ammon
-1.55
licence
-1.51
practise
-1.44
ndum
-1.43
stagn
-1.42
arf
-1.41
uka
-1.40
stake
-1.39
POSITIVE LOGITS
edIn
1.98
CLASS
1.78
oples
1.72
ovych
1.70
leck
1.65
Rouge
1.62
arthed
1.59
HUD
1.58
-+-+
1.55
��
1.53
Activations Density 0.000%
No Known Activations
This feature has no known activations.