INDEX
Explanations
publication information and details about books and their editions
New Auto-Interp
Negative Logits
akes
-0.15
kus
-0.14
akens
-0.14
olini
-0.14
rel
-0.14
_PTR
-0.14
eron
-0.13
ished
-0.13
relational
-0.13
Kaiser
-0.13
POSITIVE LOGITS
197
0.21
Morrow
0.17
Vintage
0.17
198
0.17
196
0.17
Doub
0.17
Random
0.16
Benchmark
0.16
Random
0.16
éº
0.16
Activations Density 0.104%