INDEX
Explanations
references to historical figures and their works related to intelligence and education
New Auto-Interp
Head Attr Weights
0:0.03
1:0.09
2:0.02
3:0.04
4:0.03
5:0.27
6:0.02
7:0.01
8:0.05
9:0.12
10:0.19
11:0.07
Negative Logits
doors
-1.50
Activate
-1.41
Deploy
-1.40
Ops
-1.36
"}],"
-1.33
||||
-1.33
launcher
-1.33
keeper
-1.30
Roaming
-1.30
escal
-1.29
POSITIVE LOGITS
manuscripts
1.76
diagrams
1.72
cartoons
1.67
coined
1.65
equations
1.62
invented
1.62
DragonMagazine
1.59
writings
1.59
iewicz
1.58
Journals
1.58
Activations Density 0.134%