INDEX
Explanations
special characters and symbols
patterns or symbols that indicate formatting or structure in text
New Auto-Interp
Negative Logits
inav
-0.72
lling
-0.71
ollar
-0.69
irst
-0.66
spir
-0.65
clipping
-0.64
deliber
-0.61
lections
-0.60
ovo
-0.60
pora
-0.59
POSITIVE LOGITS
*/
1.02
*****
0.89
=================================================================
0.87
===
0.84
=-=-
0.84
-->
0.83
=-
0.76
---
0.76
edit
0.76
quote
0.76
Activations Density 0.065%