INDEX
Explanations
numerical comparisons and thresholds related to percentages
New Auto-Interp
Negative Logits
hal
-0.37
surla
-0.37
RSpec
-0.37
mix
-0.35
msgSender
-0.34
взять
-0.34
Cav
-0.34
ViewFeatures
-0.33
festi
-0.32
mis
-0.31
POSITIVE LOGITS
THAN
0.73
than
0.71
Than
0.65
ویکیپدی
0.63
Than
0.58
than
0.58
noDo
0.56
YOND
0.56
THAN
0.54
帖最后由
0.54
Activations Density 0.662%