But the Y2038 problem will be giving us headaches long, long before 2038 arrives.
I& #39;d like to tell you a story about this.
                        
                        
                        One of my clients is responsible for several of the world& #39;s top 100 pension funds.
They had a nightly batch job that computed the required contributions, made from projections 20 years into the future.
It crashed on January 19, 2018 â 20 years before Y2038.
                    
                                    
                    They had a nightly batch job that computed the required contributions, made from projections 20 years into the future.
It crashed on January 19, 2018 â 20 years before Y2038.
                        
                        
                        No one knew what was wrong at first.
This batch job had never, ever crashed before, as far as anyone remembered or had logs for.
The person who originally wrote it had been dead for at least 15 years, and in any case hadn& #39;t been employed by the firm for decades.
                    
                                    
                    This batch job had never, ever crashed before, as far as anyone remembered or had logs for.
The person who originally wrote it had been dead for at least 15 years, and in any case hadn& #39;t been employed by the firm for decades.
                        
                        
                        The program was not that big, maybe a few hundred lines.
But it was fairly impenetrable â written in a style that favored computational efficiency over human readability.
And of course, there were zero tests.
                    
                                    
                    But it was fairly impenetrable â written in a style that favored computational efficiency over human readability.
And of course, there were zero tests.
                        
                        
                        As luck would have it, a change in the orchestration of the scripts that ran in this environment had been pushed the day before.
This was believed to be the culprit. Engineering rolled things back to the previous release.
Unfortunately, this made the problem worse.
                    
                                    
                    This was believed to be the culprit. Engineering rolled things back to the previous release.
Unfortunately, this made the problem worse.
                        
                        
                        You see, the program& #39;s purpose was to compute certain contribution rates for certain kinds of pension funds.
It did this by writing out a big CSV file. The results of this CSV file were inputs to other programs.
Those ran at various times each day.
                    
                                    
                    It did this by writing out a big CSV file. The results of this CSV file were inputs to other programs.
Those ran at various times each day.
                        
                        
                        Another program, the benefits distributor, was supposed to alert people when contributions weren& #39;t enough for projections.
It hadn& #39;t run yet when the initial problem occurred. But it did now.
                    
                                    
                    It hadn& #39;t run yet when the initial problem occurred. But it did now.
                        
                        
                        Noticing that there was no output from the first program since it had crashed, it treated this case as "all contributions are 0".
This, of course, was not what it should do.
But no one knew it behaved this way since, again, the first program had never crashed.
                    
                                    
                    This, of course, was not what it should do.
But no one knew it behaved this way since, again, the first program had never crashed.
                        
                        
                        This immediately caused a massive cascade of alert emails to the internal pension fund managers.
They promptly started flipping out, because one reason contributions might show up as insufficient is if projections think the economy is about to tank.
                    
                                    
                    They promptly started flipping out, because one reason contributions might show up as insufficient is if projections think the economy is about to tank.
                        
                        
                        The firm had recently moved to the cloud and I had been retained to architect the transition and make the migration go smoothly.
They& #39;d completed the work months before. I got an unexpected text from the CIO:
                    
                                    
                    They& #39;d completed the work months before. I got an unexpected text from the CIO:
                        
                        
                        S1X is their word for "worse than severity 1 because it& #39;s cascading *other* unrelated parts of the business". 
There had only been one other S1X in twelve months.
                    
                                    
                    There had only been one other S1X in twelve months.
                        
                        
                        I got onsite late that night. We eventually diagnosed the issue by firing up an environment and isolating the script so that only it was running.
The problem immediately became more obvious; there was a helpful error message that pointed to the problematic part.
                    
                                    
                    The problem immediately became more obvious; there was a helpful error message that pointed to the problematic part.
                        
                        
                        We were able to resolve the issue by hotpatching the script.
But by then, substantive damage had already been done because contributions hadn& #39;t been processed that day.
It cost about $1.7M to manually catch up over the next two weeks.
                    
                                    
                    But by then, substantive damage had already been done because contributions hadn& #39;t been processed that day.
It cost about $1.7M to manually catch up over the next two weeks.
                        
                        
                        The moral of the story is that Y2038 isn& #39;t "coming".
It& #39;s *already here*. Fix your stuff. https://abs.twimg.com/emoji/v2/... draggable="false" alt="âšď¸" title="Square for stop" aria-label="Emoji: Square for stop">
https://abs.twimg.com/emoji/v2/... draggable="false" alt="âšď¸" title="Square for stop" aria-label="Emoji: Square for stop">
                        
                        
                        
                        
                                                
                    
                    
                                    
                    It& #39;s *already here*. Fix your stuff.
                        
                        
                        Postscript: there& #39;s lots more that I think would be interesting to say on this matter that won& #39;t fit in a tweet.
If you& #39;re looking for speakers at your next conference on this topic, I& #39;d be glad to expound further. I don& #39;t want to be cleaning up more Y2038 messes! https://abs.twimg.com/emoji/v2/... draggable="false" alt="đ" title="Smiling face with open mouth and smiling eyes" aria-label="Emoji: Smiling face with open mouth and smiling eyes">
https://abs.twimg.com/emoji/v2/... draggable="false" alt="đ" title="Smiling face with open mouth and smiling eyes" aria-label="Emoji: Smiling face with open mouth and smiling eyes">
                        
                        
                        
                        
                                                
                    
                    
                
                If you& #39;re looking for speakers at your next conference on this topic, I& #39;d be glad to expound further. I don& #39;t want to be cleaning up more Y2038 messes!
 
                         Read on Twitter
Read on Twitter 
                             
                                     
                                    