INDEX
Explanations
phrases that emphasize the concept of news reporting or significant events
New Auto-Interp
Negative Logits
awa
-0.77
KK
-0.72
respective
-0.71
wagon
-0.69
PB
-0.67
supervised
-0.67
onne
-0.67
Canaver
-0.67
netflix
-0.65
aceous
-0.64
POSITIVE LOGITS
Errors
0.89
Hours
0.74
Herald
0.74
Seasons
0.72
Liberties
0.67
Times
0.66
requency
0.65
Inquiry
0.64
Planet
0.63
Ages
0.63
Activations Density 0.020%