INDEX
Explanations
asking for information or advice
New Auto-Interp
Negative Logits
しっかりと
0.45
ವರಿಗೆ
0.45
deteriorating
0.43
deteriorate
0.42
ძალიან
0.41
ಂಭ
0.41
y
0.40
驊
0.39
cknow
0.39
annoy
0.39
POSITIVE LOGITS
answers
0.66
aiuto
0.61
assistance
0.60
réponses
0.60
seek
0.57
informazioni
0.56
sought
0.55
elusive
0.55
答案
0.54
bantuan
0.52
Activations Density 0.138%