INDEX
Explanations
¡ followed by greetings or affirmative responses
New Auto-Interp
Negative Logits
birds
0.38
collections
0.36
دارای
0.36
を受
0.35
betrieb
0.35
を有する
0.35
የበ
0.35
cows
0.35
familiarity
0.35
workers
0.34
POSITIVE LOGITS
estoy
0.73
puedo
0.69
posso
0.69
думаю
0.67
можете
0.65
gostaria
0.64
vreau
0.64
Tôi
0.62
mogę
0.62
знаю
0.61
Activations Density 0.017%