INDEX
Explanations
phrases related to health implications and significant societal influences
New Auto-Interp
Negative Logits
asiatique
-0.66
supérieurs
-0.63
autoradio
-0.62
japonaise
-0.60
connues
-0.60
fotografico
-0.60
connus
-0.59
picioare
-0.59
solito
-0.58
botanist
-0.58
POSITIVE LOGITS
']}
0.84
%"),
0.83
'));
0.81
'),
0.79
'))
0.79
')):
0.78
'},
0.78
")));
0.78
}")
0.77
"])
0.76
Activations Density 1.174%