INDEX
Explanations
the presence of quotation marks
quotation marks and their surrounding phrases
New Auto-Interp
Negative Logits
buoy
-0.73
tabloid
-0.73
arri
-0.71
seasoned
-0.71
Prim
-0.71
vigil
-0.70
differentiated
-0.69
disciplinary
-0.69
killers
-0.69
pacing
-0.68
POSITIVE LOGITS
true
1.32
false
1.30
personal
1.23
yes
1.22
shall
1.20
normal
1.19
SELECT
1.17
cold
1.17
need
1.16
single
1.15
Activations Density 0.105%