INDEX
Explanations
instances of the word "kind"
New Auto-Interp
Negative Logits
Autoritní
-0.63
TestBed
-0.56
occhiali
-0.56
UVWXYZ
-0.56
denuncias
-0.55
SafeMath
-0.53
sichtbar
-0.53
tecnici
-0.53
respirar
-0.53
/******/
-0.53
POSITIVE LOGITS
sorta
0.69
hearted
0.67
hearted
0.67
Kind
0.63
KIND
0.60
KIND
0.58
surla
0.55
ERG
0.53
sort
0.52
legen
0.52
Activations Density 0.038%