INDEX
Explanations
expressions related to knowledge, understanding, and inquiry
New Auto-Interp
Head Attr Weights
0:0.07
1:0.04
2:0.04
3:0.04
4:0.03
5:0.16
6:0.02
7:0.04
8:0.36
9:0.04
10:0.07
11:0.05
Negative Logits
Surv
-1.87
Hai
-1.85
Cance
-1.83
Flan
-1.66
Strike
-1.66
Bec
-1.65
Beau
-1.64
Bye
-1.58
Exec
-1.56
Inv
-1.56
POSITIVE LOGITS
thumbnails
1.61
audi
1.60
amiya
1.59
strate
1.57
medium
1.56
Jet
1.54
soDeliveryDate
1.53
Moscow
1.51
arest
1.50
MRI
1.49
Activations Density 0.047%