INDEX
Explanations
occurrences of the prepositions "of" and "a"
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.06
3:0.05
4:0.07
5:0.04
6:0.16
7:0.36
8:0.03
9:0.04
10:0.05
11:0.05
Negative Logits
soDeliveryDate
-1.69
sqor
-1.67
iterranean
-1.63
osuke
-1.61
osponsors
-1.59
FSA
-1.59
renters
-1.59
76561
-1.59
Cosponsors
-1.55
erenn
-1.53
POSITIVE LOGITS
brilliance
1.74
stroke
1.63
Ball
1.62
memory
1.60
fingers
1.59
imagination
1.55
magic
1.50
Doodle
1.44
intuition
1.43
bery
1.42
Activations Density 0.000%