INDEX
Explanations
phrases expressing strong recommendations or endorsements
preceding or related to recommendations
making strong recommendations
New Auto-Interp
Negative Logits
RenderAtEndOf
-0.65
caufe
-0.63
Autoritní
-0.63
pleaſure
-0.63
fubject
-0.63
Majefty
-0.62
שוליים
-0.60
cauſe
-0.60
ſmall
-0.59
elry
-0.57
POSITIVE LOGITS
recommend
1.34
recommends
1.05
recommended
1.01
highly
1.00
recommend
0.98
Recommend
0.98
strongly
0.94
Recommend
0.94
RECOMMEND
0.92
recommending
0.91
Activations Density 0.078%