Domo’s Cloud Amplifier is altering the best way folks can pull collectively information from totally different programs, to allow them to make an actual impression with much less problem. Cloud Amplifier works with the information infrastructure you already use, making it less complicated to remodel, visualize, and transfer your information round.
This software also can assist lower your expenses by making these duties run smoother in your cloud warehouse. Let’s speak about how Cloud Amplifier may have an effect on your cloud warehouse prices, plus methods for optimizing them.
How cloud warehouse suppliers sometimes cost
Cloud warehouse suppliers cost based mostly on the variety of compute clusters, often known as digital warehouses, used to course of information queries. Every cluster is allotted a certain quantity of CPU energy, reminiscence, and short-term storage for working queries.
These clusters are designed to function for minimal set durations and have a set idle time. In different phrases, if no queries are working, they’ll robotically shut down after a sure time. This setup helps decide the price of utilizing cloud warehouse providers.
Only a single question can get a compute cluster working, which prices cash. That’s why database directors and information engineers preserve a detailed eye on when and the way queries are made. They need to forestall spending cash on pointless compute clusters and discover alternatives the place fewer assets may get the identical job performed. This cautious monitoring helps keep away from pointless prices and retains every little thing working effectively.
Case Research with Snowflake
Activity: Two queries that every take about 5 seconds
State of affairs 1: Twin warehouses
Let’s think about utilizing Snowflake with two queries, every taking about 5 seconds. If every question runs by itself extra-small warehouse (i.e., compute cluster) and people warehouses flip off after being idle for 60 seconds, you find yourself paying for a bit extra time than you may anticipate. On this case, since there have been no different queries to maintain them busy, every warehouse ran for 65 seconds whole. Since two separate warehouses have been used, which means you paid for 130 seconds of compute time altogether.
State of affairs 2: Single warehouse
On this second situation, think about working the identical two queries, however this time utilizing only one digital warehouse. Since this warehouse has sufficient CPU, reminiscence, and storage to deal with each queries on the identical time, they’re processed collectively. This setup solely makes use of 65 seconds of warehouse time for each queries, successfully chopping the price in half in comparison with utilizing two separate warehouses.
Selecting your warehouse technique: Price allocation vs price financial savings
So, why would you need to have a number of compute clusters working on the identical time? When managing information, utilizing a number of compute clusters could make sense for a couple of causes:
- Warehouses can function wonderful proxies for price facilities and finances allocation.
- You’ll have totally different swimming pools of compute clusters for various kinds of jobs.
- You’ll have caps that restrict the period of time sure compute clusters can run.
We now have a tradeoff to think about. If your organization’s technique is primarily to cut back prices wherever potential, it’s good to make use of clusters which might be already working. You’re already paying for the CPU, reminiscence, and storage when a compute cluster is working. Why not make full use of them? This manner, you maximize what you’re already investing in.
In case your technique prioritizes having a transparent understanding of prices for particular tasks or departments, you could select to arrange separate compute clusters tailor-made to those wants.
This strategy improves your visibility into expenditures, aligning spending with the areas you’re monitoring intently. Nonetheless, the danger is working a number of compute clusters that aren’t totally utilized, working beneath their capability—an inefficient use of assets.
So, what do I like to recommend? Usually, it’s more cost effective to make the most of present compute clusters as a substitute of organising new ones particularly for Cloud Amplifier. Whereas there could be good causes to create devoted clusters for Cloud Amplifier connections, these can usually result in greater prices.
Alternatively, reusing present clusters can present the identical advantages at a diminished price, making Cloud Amplifier a extra economical selection with out compromising on worth.
Optimizing with Cloud Amplifier
Cloud Amplifier freshness checks and TTL cache
Understanding how warehouses cost for queries, now you can be extra conscious about utilizing Domo’s information freshness checks to steadiness the necessity for probably the most up-to-date information. By default for Snowflake, Cloud Amplifier will ship a question that appears one thing like this:

A single question will likely be made for all tables in a schema. If in case you have registered DataSets in Domo for tables unfold out throughout 10 totally different schemas, you will note 10 freshness test queries (one for every schema).
Most Cloud Amplifier engines will do an analogous question to test for information freshness (although BigQuery makes use of an API, and no question to info.schema is critical). This information freshness question would require {that a} compute cluster begin. The outcomes of this freshness test question are used for a number of operations:
- Figuring out whether or not an ETL must execute now that new information is accessible
- Evaluating for any information set alerts now that new information is accessible
- Figuring out whether or not Domo wants to question the cloud warehouse subsequent time information is requested (i.e., a card in Domo is seen) or if that very same information already exists in Domo’s cache.
NOTE: Domo caches the outcomes of particular person queries, not the whole information desk. The cache TTL is ready to fifteen minutes by default
Compute cluster prices from freshness checks
On the time of scripting this weblog, these freshness test queries will execute each quarter-hour, and normally require a compute cluster to be working within the cloud warehouse (once more, BigQuery being an exception). If a compute cluster isn’t already working, a compute cluster will likely be initiated. This then represents a possible price to your corporation, relying on whether or not or not you have already got a compute cluster working.
So, think about rigorously how usually your information is already updating. Setting the freshness test interval to be extra frequent than your information’s replace schedule may result in pointless compute cluster prices.
One other consideration is whether or not you need the freshness test queries to run at off-hours when folks aren’t essentially viewing or utilizing the information. For instance, persevering with to carry out freshness test queries at night-time hours when folks aren’t sometimes Domo can result in hidden compute cluster prices that add up shortly.
Navigating Cloud Amplifier and value concerns with Domo
Price concerns can appear daunting, however Domo is right here to help you with the correct instruments to deal with these discussions successfully. We’re dedicated to helping you thru this course of and can preserve offering a sequence of upcoming weblog posts to information you.
If in case you have particular wants or questions, don’t hesitate to succeed in out to your Domo account workforce to discover the main points of your use case.