INDEX
Explanations
expressions of humor and irony
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.05
3:0.08
4:0.12
5:0.02
6:0.04
7:0.40
8:0.03
9:0.03
10:0.05
11:0.08
Negative Logits
ailability
-2.33
tradem
-2.04
phabet
-1.97
apons
-1.91
psey
-1.84
querque
-1.74
ngth
-1.72
ompl
-1.69
chnology
-1.68
ascus
-1.67
POSITIVE LOGITS
Lange
1.61
[&
1.59
jokes
1.57
Planet
1.50
Watkins
1.46
Doodle
1.38
McA
1.37
Unch
1.36
Simple
1.33
Constable
1.33
Activations Density 0.006%