INDEX
Explanations
dates and numerical references
New Auto-Interp
Negative Logits
éŀ
-0.17
okit
-0.16
Hubb
-0.15
appa
-0.15
hub
-0.14
ollo
-0.14
iÄĩ
-0.14
azo
-0.14
orta
-0.14
umann
-0.14
POSITIVE LOGITS
feed
0.16
-feed
0.15
/DTD
0.14
Feed
0.14
stÃŃ
0.14
feed
0.14
yonel
0.14
-haspopup
0.13
Dün
0.13
ended
0.13
Activations Density 0.012%