INDEX
Explanations
names consisting of two words, with the second word being 'Wang'
mentions of the name "Wang."
New Auto-Interp
Negative Logits
phia
-0.76
TAIN
-0.74
ctic
-0.67
tarians
-0.66
cel
-0.65
charge
-0.62
phis
-0.62
judicial
-0.62
sucker
-0.61
unch
-0.60
POSITIVE LOGITS
wana
1.06
enegger
1.02
Chao
0.90
atu
0.86
aii
0.82
arro
0.80
orst
0.80
atche
0.78
ako
0.78
ata
0.77
Activations Density 0.018%