INDEX
Explanations
phrases that invite interaction or engagement with the audience
New Auto-Interp
Negative Logits
u
-0.14
igne
-0.14
jah
-0.14
inka
-0.14
fortune
-0.13
ott
-0.13
ált
-0.13
oru
-0.13
alia
-0.13
anza
-0.13
POSITIVE LOGITS
visit
0.21
visit
0.21
IMA
0.17
Lon
0.17
visita
0.16
visits
0.16
_visit
0.16
iad
0.16
659
0.15
visite
0.15
Activations Density 0.049%