INDEX
Explanations
phrases related to percentages or numerical figures
references to percentages, particularly in the context of probabilities or statistics
New Auto-Interp
Negative Logits
actor
-0.81
zag
-0.71
uden
-0.68
teen
-0.63
hood
-0.63
agents
-0.61
agitation
-0.61
incons
-0.60
undown
-0.60
faced
-0.60
POSITIVE LOGITS
80
0.98
211
0.83
dayName
0.82
70
0.81
%"
0.80
eenth
0.79
olver
0.77
60
0.77
%
0.76
percent
0.75
Activations Density 0.071%