INDEX
Explanations
phrases indicating varying degrees of comparison or importance
phrases that emphasize intensity or degree
New Auto-Interp
Negative Logits
ãĤ¼ãĤ¦ãĤ¹
-0.61
Halls
-0.60
enhagen
-0.60
ãĥį
-0.59
KM
-0.58
acre
-0.58
MORE
-0.56
REC
-0.55
SHARES
-0.54
ãĥ«
-0.54
POSITIVE LOGITS
apy
1.01
othes
0.97
oths
0.97
bered
0.94
othe
0.87
zin
0.79
aker
0.76
iled
0.76
aps
0.76
oner
0.74
Activations Density 0.045%