INDEX
Explanations
personal or company names with a specific structure, possibly related to legal or financial documents
occurrences of numerical values or statistics within a text
New Auto-Interp
Negative Logits
boro
-0.75
homebrew
-0.74
Baxter
-0.68
Canter
-0.66
Hancock
-0.66
Ginny
-0.66
Davis
-0.64
temptation
-0.63
Berry
-0.63
Blake
-0.63
POSITIVE LOGITS
Moreover
1.04
Therefore
1.02
à¨
1.00
à¦
0.94
Pakistan
0.94
³³³³³³³³³³³³³³³³
0.94
à©
0.94
Express
0.92
However
0.90
Furthermore
0.89
Activations Density 0.400%