INDEX
Explanations
terms related to levels or intensities of a characteristic or attribute
references to severity or intensity of various situations
New Auto-Interp
Negative Logits
icia
-0.78
vl
-0.76
erity
-0.74
frog
-0.73
aughlin
-0.71
BIL
-0.70
ply
-0.70
ISTORY
-0.69
uba
-0.69
mith
-0.69
POSITIVE LOGITS
underlying
1.00
respective
0.98
relationship
0.95
offending
0.94
argument
0.93
spectrum
0.92
phenomenon
0.89
latter
0.89
equation
0.88
problem
0.87
Activations Density 0.252%