INDEX
Explanations
mentions of names, especially the name "Yas" at various activations
mentions of specific names, particularly "Yas" and "Amit"
New Auto-Interp
Negative Logits
arity
-0.86
ichick
-0.75
Transfer
-0.72
Africans
-0.69
ancial
-0.69
Lomb
-0.68
roads
-0.67
African
-0.67
gomery
-0.66
source
-0.65
POSITIVE LOGITS
Yas
1.20
seiz
0.98
mosqu
0.95
ushi
0.88
resil
0.87
redes
0.87
suspic
0.83
simultane
0.80
uki
0.80
©¶æ
0.75
Activations Density 0.012%