INDEX
Explanations
phrases or sentences with intensifiers and actions escalating towards a critical point
phrases indicating extreme situations or conditions
New Auto-Interp
Negative Logits
akia
-0.58
rounder
-0.58
odge
-0.57
DRAG
-0.57
Together
-0.56
Others
-0.56
missions
-0.55
asts
-0.55
Variant
-0.54
olly
-0.52
POSITIVE LOGITS
where
1.07
whereby
0.83
whence
0.80
point
0.79
lessness
0.78
that
0.76
manship
0.74
ophys
0.74
brink
0.72
wherein
0.71
Activations Density 0.041%