INDEX
Explanations
numerical values related to measurements or times
New Auto-Interp
Negative Logits
NEGLIGENCE
-0.14
ãĥĨãĥ«
-0.14
gom
-0.13
tÃŃm
-0.13
dat
-0.13
ToFit
-0.13
537
-0.13
zimmer
-0.13
rimon
-0.13
sack
-0.13
POSITIVE LOGITS
IDO
0.16
ervas
0.15
finity
0.15
using
0.15
zes
0.15
ppo
0.15
finally
0.14
unden
0.14
errupt
0.14
flush
0.14
Activations Density 0.022%