INDEX
Explanations
words relating to the city of Kolkata
references to the city of Kolkata and related locations
New Auto-Interp
Negative Logits
ModLoader
-0.72
eur
-0.68
9999
-0.67
Schr
-0.67
tall
-0.65
resses
-0.63
wid
-0.62
BILITIES
-0.62
handsome
-0.61
ãĥ¼ãĥĨ
-0.60
POSITIVE LOGITS
ovy
1.03
estone
1.00
lore
0.95
owsky
0.93
atan
0.93
ata
0.92
owa
0.90
unin
0.89
odon
0.87
olk
0.87
Activations Density 0.015%