INDEX
Explanations
pronouns referring to "you"
New Auto-Interp
Negative Logits
avait
0.38
Coast
0.38
Browns
0.37
Post
0.37
Clancy
0.36
singularity
0.35
Cedex
0.35
Sav
0.35
ود
0.35
NY
0.35
POSITIVE LOGITS
ם
0.55
אתה
0.50
usted
0.49
гиз
0.47
حضرتك
0.44
ıza
0.43
ﻢ
0.42
Anda
0.41
Anda
0.41
ೀರಿ
0.41
Activations Density 0.003%