Our old duct-taped backend got us to over 1,000 customers before we hit scaling issues: servers started running out of memory, the job queue would get backed up, and customers were waiting longer and longer for videos to finish. This was something we put off for as long as we could.
We spent two months rewriting our entire backend and now we're really starting to see the benefits. Our AWS bill decreased by more than 50%, video wait times dropped by several minutes, and the system is much more robust with increased logging visibility and retry logic.