INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.09
3:0.08
4:0.09
5:0.08
6:0.09
7:0.07
8:0.08
9:0.08
10:0.09
11:0.08
Negative Logits
sted
-1.68
andise
-1.54
uminati
-1.44
ificant
-1.43
UNESCO
-1.43
Parish
-1.42
SIGN
-1.42
EMENT
-1.40
barg
-1.40
ords
-1.38
POSITIVE LOGITS
runner
1.68
chens
1.50
hitting
1.47
endon
1.46
adier
1.45
iologist
1.40
abuser
1.37
Runner
1.36
iquette
1.35
apist
1.35
Activations Density 0.000%
No Known Activations
This feature has no known activations.