Atlassian twice shunned AWS Graviton CPUs, but now runs Jira and Confluence on them
(2025/11/13)
- Reference: 1762996917
- News link: https://www.theregister.co.uk/2025/11/13/atlassian_aws_graviton_migration/
- Source link:
Atlassian twice marked Amazon Web Services’ Graviton CPUs off-limits for production purposes, but recently relented and now uses the processors to power thousands of server instances that run its Jira and Confluence products. So what changed?
A recent [1]post by Atlassian principal site reliability engineers Paulo Almeida and Jakub Gutkowski, plus software engineer Jędrzej Lepa, reveals that Atlassian was interested in Amazon’s CPUs after the 2020 release of Graviton 2.
“In the tech industry there are only a couple of things that can make engineers try something new. Either it’s because we can do something faster or because we can do something cheaper ,” the Atlassian trio wrote. “When only one of those two benefits presents itself, we often resort to long trade-off conversations because we not only have to factor in the immediate gains but also the invisible costs of implementation, support, tooling ecosystem, edge cases, and so on.”
[2]
Atlassian noted AWS’s claims that Graviton would be both faster and cheaper, which made the processors worthy of investigation – but its early tests did not support Amazon’s assertions.
[3]
[4]
“There have been previous investigations into the feasibility of migrating Jira to Graviton 2 and Graviton 3 at Atlassian,” the trio wrote. Those efforts produced results that showed Graviton processors were not superior, but didn’t unearth a smoking gun that showed exactly how Amazon’s CPUs slowed Atlassian’s code.
“From a performance engineer’s view, this is a problem,” Atlassian’s post states. “When issues appear at specific parts of our code, at least you know where to investigate next. But when problems span multiple endpoints and areas of your code, pinpointing a fix is like finding a needle in a haystack.”
[5]
Atlassian doesn’t have unlimited resources, so after its site reliability engineering team could not reach consensus about why Graviton wasn’t delivering, the company parked it.
“The sentiment for a while across the SRE team was ‘Graviton is off limits for now; should we ever want to try it again, we need an unquestionable and reproducible evidence that explains why Jira and Confluence couldn’t run before with the desired performance (and what makes it possible now)’,” the post explains.
[6]Atlassian migrated 4 million Postgres databases to shrink AWS bill
[7]Atlassian's Trello redesign may be 'worst in tech history' say frustrated users
[8]Atlassian's move to cloud-only means customers face integration issues and more
[9]Atlassian drops $1B on company that helps measure dev productivity
In September 2024, AWS switched on the EC2 instances running Graviton 4 and again claimed they delivered superior performance. Atlassian decided to take another look and this time devised tests it hoped would yield more granular data about the performance of Amazon’s chips with its code.
That effort produced insights into the way Graviton processors use their L3 cache.
“This finding was valuable because it helped us move beyond the biased ‘Graviton-doesn’t-have-enough-horse-power’ knee-jerk reaction and pointed us to something more tangible, consistent and reproducible across multiple tests,” Atlassian’s authors wrote. It also suggested ways that Atlassian could optimize its use of the Java virtual machine (JVM) so that Graviton’s caching quirks didn’t negatively impact performance.
[10]
Those JVM tweaks produced improved performance.
“In theory all we would to have to do is to change the instance type to a Graviton-based equivalent and happy days, right?” the post states, before answering that question with an observation that when AWS launches a new instance type, the incidence of Insufficient Capacity Errors (ICE) increases.
“Atlassian has worked with the cloud long enough to realise that what might be an exception for smaller customers is common for us. ICE errors occur between 10,000 to 15,000 times per hour worldwide for Atlassian,” the post states.
The company therefore devised a scheme that sees it use EC2 instances powered by Graviton 4 processors for user-facing tasks and older Gravitons for other workloads, while ensuring workloads can fallback to x86 instances when required.
Atlassian has since migrated over 3,000 Jira and Confluence instances to Graviton, and after has chalked up “around 9.8 percent savings across the board, while improving the experience for our customers with ... meaningfully lower latencies.”
The post says Atlassian’s cost to operate some of its cloudy infrastructure has fallen 25 percent, but achieving that on all systems “will require a bit of reorganisation of our fleet to unlock.”
Atlassian users will probably hope the company gets that re-org done quickly, as the company hiked prices in October and has promised to [11]start charging for its Rovio AI at an unspecified future point. ®
Get our [12]Tech Resources
[1] https://www.atlassian.com/blog/atlassian-engineering/migrating-the-jira-and-confluence-applications-to-aws-graviton
[2] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=2&c=2aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D2%26raptor%3Dcondor%26pos%3Dtop%26test%3D0
[3] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0
[4] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0
[5] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0
[6] https://www.theregister.com/2025/07/07/asia_tech_news_in_brief/
[7] https://www.theregister.com/2025/08/06/trello_redesign_as_bad_as/
[8] https://www.theregister.com/2025/09/09/atlassian_will_go_cloudonly_customers/
[9] https://www.theregister.com/2025/09/18/atlassian_dx_purchase/
[10] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0
[11] https://www.theregister.com/2025/04/10/atlassian_rovo_free_teams_news/
[12] https://whitepapers.theregister.com/
A recent [1]post by Atlassian principal site reliability engineers Paulo Almeida and Jakub Gutkowski, plus software engineer Jędrzej Lepa, reveals that Atlassian was interested in Amazon’s CPUs after the 2020 release of Graviton 2.
“In the tech industry there are only a couple of things that can make engineers try something new. Either it’s because we can do something faster or because we can do something cheaper ,” the Atlassian trio wrote. “When only one of those two benefits presents itself, we often resort to long trade-off conversations because we not only have to factor in the immediate gains but also the invisible costs of implementation, support, tooling ecosystem, edge cases, and so on.”
[2]
Atlassian noted AWS’s claims that Graviton would be both faster and cheaper, which made the processors worthy of investigation – but its early tests did not support Amazon’s assertions.
[3]
[4]
“There have been previous investigations into the feasibility of migrating Jira to Graviton 2 and Graviton 3 at Atlassian,” the trio wrote. Those efforts produced results that showed Graviton processors were not superior, but didn’t unearth a smoking gun that showed exactly how Amazon’s CPUs slowed Atlassian’s code.
“From a performance engineer’s view, this is a problem,” Atlassian’s post states. “When issues appear at specific parts of our code, at least you know where to investigate next. But when problems span multiple endpoints and areas of your code, pinpointing a fix is like finding a needle in a haystack.”
[5]
Atlassian doesn’t have unlimited resources, so after its site reliability engineering team could not reach consensus about why Graviton wasn’t delivering, the company parked it.
“The sentiment for a while across the SRE team was ‘Graviton is off limits for now; should we ever want to try it again, we need an unquestionable and reproducible evidence that explains why Jira and Confluence couldn’t run before with the desired performance (and what makes it possible now)’,” the post explains.
[6]Atlassian migrated 4 million Postgres databases to shrink AWS bill
[7]Atlassian's Trello redesign may be 'worst in tech history' say frustrated users
[8]Atlassian's move to cloud-only means customers face integration issues and more
[9]Atlassian drops $1B on company that helps measure dev productivity
In September 2024, AWS switched on the EC2 instances running Graviton 4 and again claimed they delivered superior performance. Atlassian decided to take another look and this time devised tests it hoped would yield more granular data about the performance of Amazon’s chips with its code.
That effort produced insights into the way Graviton processors use their L3 cache.
“This finding was valuable because it helped us move beyond the biased ‘Graviton-doesn’t-have-enough-horse-power’ knee-jerk reaction and pointed us to something more tangible, consistent and reproducible across multiple tests,” Atlassian’s authors wrote. It also suggested ways that Atlassian could optimize its use of the Java virtual machine (JVM) so that Graviton’s caching quirks didn’t negatively impact performance.
[10]
Those JVM tweaks produced improved performance.
“In theory all we would to have to do is to change the instance type to a Graviton-based equivalent and happy days, right?” the post states, before answering that question with an observation that when AWS launches a new instance type, the incidence of Insufficient Capacity Errors (ICE) increases.
“Atlassian has worked with the cloud long enough to realise that what might be an exception for smaller customers is common for us. ICE errors occur between 10,000 to 15,000 times per hour worldwide for Atlassian,” the post states.
The company therefore devised a scheme that sees it use EC2 instances powered by Graviton 4 processors for user-facing tasks and older Gravitons for other workloads, while ensuring workloads can fallback to x86 instances when required.
Atlassian has since migrated over 3,000 Jira and Confluence instances to Graviton, and after has chalked up “around 9.8 percent savings across the board, while improving the experience for our customers with ... meaningfully lower latencies.”
The post says Atlassian’s cost to operate some of its cloudy infrastructure has fallen 25 percent, but achieving that on all systems “will require a bit of reorganisation of our fleet to unlock.”
Atlassian users will probably hope the company gets that re-org done quickly, as the company hiked prices in October and has promised to [11]start charging for its Rovio AI at an unspecified future point. ®
Get our [12]Tech Resources
[1] https://www.atlassian.com/blog/atlassian-engineering/migrating-the-jira-and-confluence-applications-to-aws-graviton
[2] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=2&c=2aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D2%26raptor%3Dcondor%26pos%3Dtop%26test%3D0
[3] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0
[4] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0
[5] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0
[6] https://www.theregister.com/2025/07/07/asia_tech_news_in_brief/
[7] https://www.theregister.com/2025/08/06/trello_redesign_as_bad_as/
[8] https://www.theregister.com/2025/09/09/atlassian_will_go_cloudonly_customers/
[9] https://www.theregister.com/2025/09/18/atlassian_dx_purchase/
[10] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_offprem/saas&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33aRVl6P-r-wH-ONwjRnXoPgAAAAI&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0
[11] https://www.theregister.com/2025/04/10/atlassian_rovo_free_teams_news/
[12] https://whitepapers.theregister.com/
Gasps
I thought Jira runs on morning gasps:
"Where is my ticket?"
"Which ticket you are working on?"
"Can you move your ticket, please?"
"Is this a child ticket?"
"Which board are you on?"