INDEX
Explanations
numerical values and dates related to events
New Auto-Interp
Negative Logits
ify
-0.19
almost
-0.18
nearly
-0.18
ekt
-0.17
hausen
-0.15
oksen
-0.15
ungle
-0.15
Nearly
-0.14
Almost
-0.14
agan
-0.14
POSITIVE LOGITS
dozen
0.22
nine
0.20
six
0.18
eight
0.17
eight
0.17
six
0.17
ä¸ī个
0.16
12
0.16
seven
0.16
dozens
0.15
Activations Density 0.157%