INDEX
Explanations
phrases indicating agreement to terms and conditions
phrases related to user agreement and consent in terms of privacy and policies
New Auto-Interp
Negative Logits
ilts
-0.61
glim
-0.60
oster
-0.60
amina
-0.58
ngth
-0.57
iculture
-0.57
mania
-0.55
ONES
-0.54
gins
-0.54
ordinary
-0.54
POSITIVE LOGITS
itivity
0.68
{{0.65
thereto
0.64
iliate
0.63
taboola
0.61
emn
0.59
Hilbert
0.59
to
0.58
20439
0.56
ettings
0.56
Activations Density 0.025%