INDEX
Explanations
numerical values separated by commas
numerical values or dates
New Auto-Interp
Negative Logits
olson
-0.75
exha
-0.66
phal
-0.65
suspic
-0.64
tti
-0.64
reluct
-0.62
deen
-0.61
llular
-0.60
hell
-0.60
everal
-0.59
POSITIVE LOGITS
00
0.91
-+
0.77
inen
0.73
raction
0.73
ulse
0.71
escription
0.70
arth
0.68
25
0.68
ort
0.68
01
0.63
Activations Density 0.036%