INDEX
Explanations
references to significant cultural or historical references, particularly in literature or societal issues
New Auto-Interp
Negative Logits
'}>
-0.72
):}
-0.66
']")
-0.63
Himo
-0.62
"}>
-0.62
%">
-0.59
Numerade
-0.59
izielle
-0.58
);?>
-0.57
Paglinawan
-0.57
POSITIVE LOGITS
Volume
1.04
Volume
0.96
volume
0.92
volume
0.84
VOLUME
0.80
VOLUME
0.79
Volumen
0.78
Vol
0.73
vol
0.70
vol
0.67
Activations Density 0.311%