INDEX
Explanations
ranges or numerical values expressed as ratios
phrases indicating approximate numerical ranges
New Auto-Interp
Negative Logits
ModLoader
-0.96
åŃIJ
-0.72
ACTED
-0.71
ById
-0.69
jay
-0.68
ãĤ´ãĥ³
-0.66
çķ
-0.65
eries
-0.63
NetMessage
-0.63
ÑĮ
-0.62
POSITIVE LOGITS
60
1.35
80
1.31
120
1.30
70
1.29
90
1.24
150
1.24
100
1.23
40
1.18
50
1.17
75
1.17
Activations Density 0.050%