INDEX
Explanations
Roman numerals, specifically 'IV' with a slightly higher activation for 'IV' at 10 compared to 9
references to a specific term or abbreviation, particularly related to 'IV' and associated phrases
New Auto-Interp
Negative Logits
goose
-0.78
Glen
-0.72
pine
-0.70
head
-0.63
Hayward
-0.63
targets
-0.62
target
-0.61
head
-0.61
Chancellor
-0.61
coffee
-0.61
POSITIVE LOGITS
IV
4.25
iv
2.22
IVES
2.08
IVE
1.95
III
1.94
VII
1.72
VI
1.69
IVER
1.67
IV
1.66
II
1.63
Activations Density 0.005%