INDEX
Explanations
words that contain the specific sequence "rom" with a high activation value
instances of the word "rom" in various contexts
New Auto-Interp
Negative Logits
Notes
-0.72
Ĵ
-0.70
ŃĶ
-0.67
Citation
-0.67
Reserve
-0.67
Jackets
-0.64
attribution
-0.63
fav
-0.62
need
-0.61
Venezuela
-0.61
POSITIVE LOGITS
rom
1.24
antic
1.12
antically
1.04
astery
1.01
thal
1.00
ancers
1.00
onse
0.95
agnetic
0.95
antics
0.93
orph
0.92
Activations Density 0.008%