INDEX
Explanations
distinctive punctuation marks or dash-like symbols that indicate a change in thought or a new statement
New Auto-Interp
Negative Logits
obbies
-0.70
eur
-0.69
respons
-0.68
manif
-0.68
milo
-0.67
vanity
-0.64
distingu
-0.63
omething
-0.61
reper
-0.60
ividual
-0.60
POSITIVE LOGITS
Expand
0.82
Reporter
0.80
Legislation
0.76
Transcript
0.74
Wrestling
0.73
Recap
0.70
CLUS
0.70
largeDownload
0.69
Warning
0.69
[[
0.68
Activations Density 0.014%