INDEX
Explanations
dollar amounts in the range from thousands to hundreds of thousands
New Auto-Interp
Negative Logits
behav
-0.78
advoc
-0.76
FW
-0.69
proble
-0.65
ingred
-0.65
advis
-0.64
*/(
-0.62
philos
-0.61
anch
-0.60
typ
-0.58
POSITIVE LOGITS
000
1.96
500
1.58
700
1.40
400
1.39
600
1.39
200
1.39
800
1.37
050
1.35
900
1.34
300
1.34
Activations Density 0.042%