INDEX
Explanations
phrases expressing curiosity or surprise regarding situations
explaining why or giving reasons
New Auto-Interp
Negative Logits
שוליים
-0.70
gyhoeddwyd
-0.49
Wiktionnaire
-0.47
ècie
-0.47
İstinadlar
-0.47
دانشنامهٔ
-0.46
Sucesor
-0.44
testSet
-0.44
-0.44
PostMapping
-0.42
POSITIVE LOGITS
reason
0.51
难怪
0.50
why
0.48
ragione
0.46
razão
0.46
razón
0.45
latego
0.44
therefore
0.44
sprechend
0.43
reasons
0.43
Activations Density 0.124%