INDEX
Explanations
numeric values related to various performance metrics or parameters
New Auto-Interp
Negative Logits
Diſ
-0.84
Majefty
-0.83
Theſe
-0.82
myſelf
-0.82
Jefus
-0.81
theſe
-0.79
becauſe
-0.77
itſelf
-0.77
Conſ
-0.77
Reſ
-0.77
POSITIVE LOGITS
IsContent
0.57
iffa
0.54
AnimationsModule
0.49
חיצוניים
0.49
ge
0.46
Nev
0.44
cellpadding
0.44
tre
0.43
bu
0.42
part
0.42
Activations Density 0.030%