INDEX
Explanations
any words that include the sequence "iti" followed by a high activation value of 7 or more
terms related to a specific organization or place called "Citi"
New Auto-Interp
Negative Logits
dress
-0.82
layer
-0.76
stroke
-0.74
worthy
-0.74
stairs
-0.73
meyer
-0.70
lessness
-0.64
fully
-0.63
sworth
-0.63
scale
-0.62
POSITIVE LOGITS
Äĩ
1.18
atives
1.05
iti
0.99
ña
0.96
Bike
0.88
emi
0.84
zed
0.83
ñ
0.83
uting
0.83
ative
0.83
Activations Density 0.015%