INDEX
Explanations
phrases related to legal matters, specifically involving threats and reporting incidents to authorities
commas and phrases related to explanations or clarifications
New Auto-Interp
Negative Logits
staggered
-0.64
lun
-0.61
oe
-0.59
efe
-0.58
isy
-0.57
output
-0.57
deliber
-0.57
istrate
-0.56
Rober
-0.56
buttocks
-0.56
POSITIVE LOGITS
prototype
0.81
SourceFile
0.80
Flavoring
0.76
esm
0.75
Devices
0.74
specific
0.74
ãĤ¼
0.73
Origin
0.72
ITS
0.72
Inc
0.71
Activations Density 0.490%