INDEX
Explanations
words related to legal issues, specifically fraud
references to fraud
New Auto-Interp
Negative Logits
%"
-0.69
ŃĶ
-0.67
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.65
UTC
-0.62
Dak
-0.62
baugh
-0.62
Arms
-0.60
bian
-0.59
Candle
-0.59
Views
-0.58
POSITIVE LOGITS
ulence
1.37
ulent
1.31
fraud
1.02
sters
1.02
ster
0.99
scam
0.86
ously
0.85
raud
0.84
perpetrated
0.84
inals
0.78
Activations Density 0.010%