INDEX
Explanations
phrases related to benefits, consequences, and implications of decisions or actions
New Auto-Interp
Negative Logits
ãĥ¼ãĥĭ
-0.19
afia
-0.17
onia
-0.16
anske
-0.15
ngo
-0.15
rych
-0.14
ë·°
-0.13
quia
-0.13
صÙĪØ±
-0.13
ufen
-0.13
POSITIVE LOGITS
SKTOP
0.16
mey
0.15
emit
0.15
nam
0.14
feas
0.14
μεÏģ
0.13
rud
0.13
/ros
0.13
éri
0.13
atom
0.13
Activations Density 0.329%