INDEX
Explanations
phrases indicating depth or analysis
references to depth or profoundness
New Auto-Interp
Negative Logits
Prosecut
-0.66
bill
-0.64
obe
-0.64
kill
-0.62
Provided
-0.62
Elect
-0.61
von
-0.59
Volunteers
-0.58
cert
-0.58
cop
-0.58
POSITIVE LOGITS
deeper
3.51
deep
2.04
deepest
1.95
closer
1.71
deepen
1.68
richer
1.66
deepening
1.66
wider
1.65
darker
1.57
eeper
1.55
Activations Density 0.010%