INDEX
Explanations
phrases related to penalty or consequence
references to increasing quantities or allowances
New Auto-Interp
Negative Logits
Maker
-0.75
Fidel
-0.65
Abyss
-0.63
Mate
-0.62
Kubrick
-0.62
mash
-0.62
Marcos
-0.61
[/
-0.61
Mori
-0.61
Ferdinand
-0.59
POSITIVE LOGITS
graded
1.02
grades
0.98
grading
0.85
dates
0.83
icum
0.81
adesh
0.80
rights
0.79
erd
0.78
essions
0.77
olicy
0.76
Activations Density 0.057%