INDEX
Explanations
phrases related to controversial topics or debates
occurrences of notation or symbols related to lists or references within a text
New Auto-Interp
Negative Logits
cradle
-0.74
awaru
-0.70
aeus
-0.68
dy
-0.68
ts
-0.67
anx
-0.67
icals
-0.65
tin
-0.65
roe
-0.64
otin
-0.64
POSITIVE LOGITS
CLASSIFIED
1.09
...]
1.04
â̦]
0.99
[...]
0.92
Appears
0.90
externalActionCode
0.90
},{"0.89
³³³³³³³³
0.89
â̦â̦â̦â̦â̦â̦â̦â̦
0.88
[â̦]
0.86
Activations Density 0.004%