INDEX
Explanations
HTML and table formatting tags
New Auto-Interp
Negative Logits
Rohy
-0.59
Spencer
-0.56
ses
-0.55
Kenney
-0.55
Whitney
-0.55
anka
-0.55
URE
-0.54
Burke
-0.54
hende
-0.54
Gough
-0.53
POSITIVE LOGITS
"],
1.10
}")
1.03
itſelf
1.01
Chwiliwch
0.97
*/;
0.96
!")
0.95
"]
0.94
`),
0.92
"])
0.92
]");
0.91
Activations Density 0.083%