INDEX
Explanations
punctuation marks and phrases used to indicate lists or series of items
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.13
3:0.12
4:0.15
5:0.04
6:0.03
7:0.06
8:0.06
9:0.07
10:0.10
11:0.13
Negative Logits
hyde
-1.49
veland
-1.43
nai
-1.41
arre
-1.33
vind
-1.33
ibal
-1.30
emale
-1.29
andre
-1.28
ajor
-1.27
aila
-1.27
POSITIVE LOGITS
then
1.42
Trails
1.23
Tomorrow
1.22
check
1.21
Origin
1.21
weeds
1.19
Profile
1.18
Puzzle
1.16
Topic
1.16
Next
1.14
Activations Density 0.035%