INDEX
Explanations
the beginning of a document
New Auto-Interp
Negative Logits
...
-0.69
...
-0.65
+
-0.62
.
-0.61
(
-0.57
....
-0.56
form
-0.55
№
-0.53
↵↵
-0.53
windowFixed
-0.53
POSITIVE LOGITS
متعلقه
1.04
pinulongan
0.90
tagHelperRunner
0.86
0.85
houſe
0.81
faſt
0.80
purpoſe
0.79
ſelves
0.78
pleaſure
0.78
Offisielt
0.77
Activations Density 0.035%