INDEX
Explanations
symbols or characters indicative of formatting or coding artifacts
New Auto-Interp
Head Attr Weights
0:0.10
1:0.03
2:0.08
3:0.03
4:0.04
5:0.03
6:0.26
7:0.04
8:0.09
9:0.19
10:0.03
11:0.02
Negative Logits
Clarke
-4.19
Coil
-4.05
catentry
-4.02
Alice
-3.97
Ö
-3.86
Alabama
-3.84
alin
-3.80
Lawson
-3.74
Palin
-3.66
Machine
-3.65
POSITIVE LOGITS
Gent
10.89
gent
7.55
gent
5.51
Vincent
4.04
Gast
3.98
nob
3.97
Sag
3.87
Pant
3.86
ghetto
3.80
Torah
3.76
Activations Density 0.000%