A default vBulletin 4.X did not actually use UTF-8 in any way shape or form. If the character was not within the standard ASCII character set, it was converted and stored as its HTML Entity. It was not stored in UTF-8 or processed as UTF-8 at any time.
The MySQL Connection has nothing to do with how data is stored or processed within vBulletin. It tells MySQL how to accept data. Even with UTF-8, we don't actually know what character is what? Why? Because UTF-8 stores characters in 2-4 bytes on MySQL. This is why MySQL actually recommends UTF8MB4 and that is what vBulletin 5 has been developed to use these days. With UTF8MB4, every character takes 4 bytes.
vBulletin will create tables with the default collation of your database. If your database's default collation is latin1_swedish_ci (like 99% of installations before MySQL 8) than that is what vBulletin will use. We assume you know more about your database than we do. You would have multiple collations if you changed the collation on some tables and did not update the default collation. The only exception to this rule is on New Installations of vBulletin 5.x will force the UTF8MB4 character set and corresponding collation. On MySQL 5.7, this should be something like utf8mb4_general_ci. On MySQL 8.0 and higher it should be utf8mb4_900_ai_ci.
Language settings for vBulletin are accessed in the AdminCP under Languages → Language Manager. The only HTML Character Set that should be used in 2022 is UTF-8. The locale should match the actual language and be the UTF-8 variant. If you do not specify the locale the text used in dates and times will not match that language.
It is always complicated converting older databases. They often have a lot of cruft and there is no way to guarantee they haven't been manipulated by external software like plugins.
The MySQL Connection has nothing to do with how data is stored or processed within vBulletin. It tells MySQL how to accept data. Even with UTF-8, we don't actually know what character is what? Why? Because UTF-8 stores characters in 2-4 bytes on MySQL. This is why MySQL actually recommends UTF8MB4 and that is what vBulletin 5 has been developed to use these days. With UTF8MB4, every character takes 4 bytes.
vBulletin will create tables with the default collation of your database. If your database's default collation is latin1_swedish_ci (like 99% of installations before MySQL 8) than that is what vBulletin will use. We assume you know more about your database than we do. You would have multiple collations if you changed the collation on some tables and did not update the default collation. The only exception to this rule is on New Installations of vBulletin 5.x will force the UTF8MB4 character set and corresponding collation. On MySQL 5.7, this should be something like utf8mb4_general_ci. On MySQL 8.0 and higher it should be utf8mb4_900_ai_ci.
Language settings for vBulletin are accessed in the AdminCP under Languages → Language Manager. The only HTML Character Set that should be used in 2022 is UTF-8. The locale should match the actual language and be the UTF-8 variant. If you do not specify the locale the text used in dates and times will not match that language.
It is always complicated converting older databases. They often have a lot of cruft and there is no way to guarantee they haven't been manipulated by external software like plugins.
Comment