INDEX
Explanations
scientific observations and findings that denote noteworthy or novel results
New Auto-Interp
Negative Logits
or
-0.50
↵↵
-0.50
:
-0.48
;
-0.47
.
-0.45
<bos>
-0.44
-
-0.44
UnknownFields
-0.43
/
-0.43
<eos>
-0.43
POSITIVE LOGITS
createSlice
1.01
ComVisible
0.84
vaders
0.80
autorytatywna
0.80
Ganzen
0.79
complexContent
0.79
stuffs
0.78
=$?
0.78
CWE
0.76
تضيفلها
0.76
Activations Density 0.571%