INDEX
Explanations
interrogative phrases and questions related to understanding or gaining knowledge
New Auto-Interp
Negative Logits
[
-0.59
[
-0.56
жен
-0.55
bari
-0.55
cu
-0.54
!
-0.54
-0.53
There
-0.51
te
-0.51
fournir
-0.50
POSITIVE LOGITS
how
1.36
Kako
1.17
कैसे
1.17
Hvordan
1.17
ways
1.15
itſelf
1.15
Nasıl
1.14
Hvordan
1.13
יצד
1.13
HOW
1.11
Activations Density 0.168%