INDEX
Explanations
instances of the word "you" and its variations
New Auto-Interp
Negative Logits
transfieras
-0.52
probably
-0.48
Probably
-0.48
Probably
-0.46
probably
-0.44
Either
-0.44
pasti
-0.41
either
-0.40
نه
-0.38
Either
-0.38
POSITIVE LOGITS
ever
0.79
compare
0.62
Ever
0.59
EVER
0.57
truly
0.56
دقت
0.55
ever
0.53
jemals
0.53
look
0.53
listened
0.52
Activations Density 0.196%