INDEX
Explanations
numerical values, particularly those related to percentages and ratings
New Auto-Interp
Negative Logits
ophon
-0.79
ĸļ
-0.77
paio
-0.76
sein
-0.71
phis
-0.68
yle
-0.68
chio
-0.65
icago
-0.64
ichick
-0.64
ei
-0.63
POSITIVE LOGITS
th
0.97
isher
0.94
ishers
0.92
ishing
0.92
%-
0.91
00
0.91
60
0.89
%
0.89
50
0.88
%:
0.88
Activations Density 0.044%