Aside from common names that you mention, native English speakers can be equally unfamiliar with names of people and places. If I hear a name I have never heard before I may double-check the pronunciation, or ask how it is spelled to help me visualise and remember that name.
There really are just so many proper nouns out there that nobody could be familiar with every one, and new names are created all the time. Even familiar names of people can often have multiple variations in spelling or pronunciation. The name Karen for example is fairly common in the UK and can also be spelled Caron, maybe other ways too. I have known people pronounce either of these as 'Kar·in' or 'Kair·on'. Some names used in English are borrowed from other languages and can sometimes be spelled in an Anglicised way but retain their original pronunciation. Names are meant to be individual, and people go out of their way to make even common ones sound a bit different.
Place names can be even more confusing, especially places within the UK and the US which have roots in either old English or native American respectively. Over time many places have changed spelling but retained something of the original pronunciation. Sometimes you really have to live in or near a place to know how it is pronounced! For example, I live near a small town called Wesham which is pronounced "Wes-ham" by people that live there, but "Wesh-um" by practically everybody else. Also, many British English speakers are baffled as to how US English speakers pronounce places such as "Maryland" (apparently Americans say 'Mari-lund', not 'Mary Land' as spelling and entymology would suggest) and so even if we have read a particular name we are not always fully prepared to recognise it when we hear it spoken for the first time.
So my answer to your first question of "do native speakers of English ALWAYS understand names they've heard for the first time in their lives on the first try" has to be no we don't! However, it is probably easier for us to know when we are hearing an unfamiliar name as opposed to an unfamiliar word because the likelihood of us not knowing a word is far less likely than an English learner. We probably also grasp context a little easier and are processing our understanding in real time as opposed to constantly translating in our heads.
As to your second question of "is something I can do to bypass this problem?" - all I can suggest is that this will hopefully become easier for you with time and practice. The more familiar you are with spoken English and the different ways native speakers can pronounce common words, the more likely you are to recognise that a proper noun is being introduced. Friends of mine whose first language is not English have commented to the effect that their learning English at school did not fully prepare them for speaking it with native speakers; however after time here they have learned to understand idioms, colloquialisms, accents etc.
One interesting thing I read about the way children pick up their native language is that they learn a great deal by what some experts call "chunking" - that is they learn "chunks" of speech first, then what the individual words mean later. For example, very young children may learn to say "good morning" as a phrase, not yet appreciating that it is two words "good" and "morning". It makes sense that this is the brain's natural way of learning language. But learning a second language is quite the opposite technique. You learn the words, the rules, and it is only when you start speaking it with natives that you learn how these are commonly strung together. I imagine that as you become more familiar with "chunks" of speech you will also more easily recognise the cue for a name to be inserted into the sentence.
𝑇𝐿;𝐷𝑅
In General American English, don't is pronounced /doʊnt/ while in Southern Standard British English, it's pronounced /dəʊnt/.
In don't you, the /t/ of don't and /j/ of you coalesce to /tʃ/. The process is called assimilation.
𝐸𝑥𝑝𝑙𝑎𝑛𝑎𝑡𝑖𝑜𝑛
Assimilation makes nearby sounds more similar to each other. The kind of assimilation in don't you is called coalescent assimilation.
When /j/ comes right after /t/, there's a tendency to assimilate them to /t͡ʃ/. The /t/ is normally articulated at the ridge right behind the top teeth (alveolar ridge), but when it comes before a /j/ (which is articulated further back in the mouth—at the hard palate), it's usually pronounced /t͡ʃ/. What happens here is that the /t/ is articulated further back in the mouth in anticipation of the following /j/, so it becomes /t͡ʃ/ i.e. they coalesce to /t͡ʃ/.
The following sounds often coalesce:
- /t/ and /j/ coalesce to /t͡ʃ/ (as in posthumous)
- /d/ and /j/ coalesce to /d͡ʒ/ (as in education)
- /s/ and /j/ coalesce to /ʃ/ (bless you is sometimes pronounced bleshoo)
- /z/ and /j/ coalesce to /ʒ/ (as in vision)
You might have noticed that in informal situations (mostly in chatting platforms), most people write contractions such as dontcha, whatcha, gotcha etc. These are the phonetic spellings of the assimilated forms.
- dontcha → don't + you
- whatcha → what + you
- gotcha → got + you
- betcha → bet + you
Similarly, 'did you' is often pronounced as /dɪdʒjuː/ because /d/ and /j/ assimilate to /d͡ʒ/.
I wonder is this something specific to American accent?
No. It's not restricted to American English.
Best Answer
Phonemically1 -ing is always /ɪŋ/. The vowel phoneme2 is decided by linguists to be /ɪ/, though it can be realised in many different ways.
Phonetically3, however, it's realised as [iŋ] in some dialects of English (particularly American); that is to say, the vowel [ɪ] raises to [i] due to the effect of the following velar nasal (nasalisation).
According to The Origins and Development of the English Language by John Algeo (p26):
1. /phonemic transcriptions/ are language specific transcriptions i.e. the way dictionaries transcribe words. /They/ can have [many different realisations, depending on the speaker and accent]
2. ‘A phoneme is a mental image of all the various realisations of one and the same sound.’ (Donka Minkova) For example, the phoneme /t/ is a mental image of many realisations such as [t], [tʰ], [t̚], [ʔ] etc., in some dialects of English. By contrast, if you substituted say b for t it would change the meaning (cf. tall and ball) so we would say that /b/ and /t/ are two distinct ‘phonemes’ in English.
3. [phonetic transcriptions] transcribe actual speech sounds i.e how people speak