INDEX
Explanations
punctuation marks indicating the end of sentences or phrases
New Auto-Interp
Head Attr Weights
0:0.17
1:0.09
2:0.06
3:0.06
4:0.04
5:0.10
6:0.02
7:0.01
8:0.17
9:0.08
10:0.03
11:0.11
Negative Logits
endez
-1.54
luck
-1.54
ADRA
-1.50
ramid
-1.49
479
-1.43
plet
-1.42
munition
-1.35
ason
-1.33
lease
-1.31
EMBER
-1.27
POSITIVE LOGITS
etc
2.08
etc
1.95
Sasha
1.62
Conclusion
1.47
OVA
1.43
Kyoto
1.38
Ibid
1.36
Finally
1.32
Iv
1.32
vo
1.25
Activations Density 0.018%