MariaDB.org-Planet-Feed

MariaDB Enterprise Server 10.6.17-13 maintenance release

2024-04-24T20:24:23+00:00

We are pleased to inform you that a new maintenance release of MariaDB Enterprise Server 10.6 is available. This is an out-of-band release which includes a fix for a regression introduced with MariaDB Enterprise Server 10.6.17-12. Users of MariaDB Enterprise Server 10.6.17-12 are encouraged to upgrade to the latest version 10.6.17-13. Download Now MariaDB Enterprise Server is an enhanced…

Source

The post MariaDB Enterprise Server 10.6.17-13 maintenance release appeared first on MariaDB.org.

Bringing Percona Experts to a City Near You

2024-04-24T13:14:55+00:00

Percona.connect, a series of free events hosted by Percona database performance experts, is coming to a city near you! This amazing learning opportunity spans six cities — across two continents — and includes educational sessions, customer testimonials, networking activities, and more. Don’t miss your chance to talk with technical evangelists, MySQL, PostgreSQL, and MongoDB experts, Percona […]

The post Bringing Percona Experts to a City Near You appeared first on MariaDB.org.

Short note re Progress and NuSphere

2024-04-23T17:45:09+00:00

Progress Software on April 19 said more about “considering” an offer for MariaDB plc (the company not the foundation).

They own NuSphere which had a dispute with MySQL AB which was settled in 2002. My happy history as a MySQL employee biases me but I thought that NuSphere was not acting angelically.

I think it won’t happen.

The post Short note re Progress and NuSphere appeared first on MariaDB.org.

Why MariaDB Is “Better” Than MySQL

2024-04-23T15:36:53+00:00

Apples or oranges?Tea or coffee?Books or eBooks?Each of these comparisons has very similar features and serves many of the same purposes, but in the end, they are different choices people make. Do you know what else belongs on this list?MariaDB or MySQL?It’s time we discuss the age-old debate of MariaDB versus MySQL and see if […]

The post Why MariaDB Is “Better” Than MySQL appeared first on MariaDB.org.

The Insert Benchmark: MariaDB, MySQL, small server, cached workload, some concurrency

2024-04-21T22:43:00+00:00

This post has results for the Insert Benchmark on a small server with a cached workload. The goal is to compare MariaDB and MySQL.

This work was done by Small Datum LLC and sponsored by the MariaDB Foundation.

The workload here has some concurrency (4 clients) and the database is cached. The results might be different when the workload is IO-bound or has more concurrency. Results were recently shared for a workload with low concurrency (1 client),

The results here are similar to the results on the low-concurrency benchmark.

tl;dr

Modern MariaDB (11.4.1) is faster than modern MySQL (8.0.36) on all benchmark steps except for qr* (range query) and l.x (create index) where they have similar performance.
Modern MariaDB (11.4.1) was at most 12% slower than older MariaDB (10.2.44). MariaDB has done a great job of avoiding performance regressions over time.
There are significant performance regressions from older MySQL (5.6) to modern MySQL (8.0)
Configuring MariaDB 10.4 to make it closer to 10.5 (only one InnoDB buffer pool, only one redo log file) reduces throughput by ~5% on several benchmark steps

Build + Configuration

This report has results for InnoDB from:

MySQL – versions 5.6.51, 5.7.44 and 8.0.36
MariaDB – versions 10.2.44, 10.3.39, 10.4.33, 10.5.24, 10.6.17, 10.11.7, 11.4.1. Versions 10.2, 10.3, 10.4, 10.5, 10.6 and 10.11 are the most recent LTS releases and 11.4 will be the next LTS release.

All of the my.cnf files are here. I tried to use similar configurations across releases, but isn’t always possible. And even when it was possible I favor setting fewer entries especially for options where the default value changes between releases.

I started with the my.cnf.cz11a_bee config and then began to make small changes. For all configs I set these values to limit the size of the history list which also keeps the database from growing larger than expected. I rarely did this in the past.

innodb_max_purge_lag=500000

innodb_max_purge_lag_delay=1000000

Some of the changes were challenging when trying to make things comparable.

the InnoDB change buffer was removed in MariaDB 11.4.

I disable it in all my.cnf files for all MariaDB versions except for the my.cnf.cz11d and my.cnf.cz11d1 configs.
I don’t disable it for the MySQL configs named my.cnf.cz11[abc]_bee but I do disable it for the my.cnf.cz11d_bee config used by MySQL. The result is that for MariaDB the my.cnf.cz11d_bee config enables the change buffer while for MySQL it disables it. Sorry for the confusion.

innodb_buffer_pool_instances was removed in MariaDB 10.5 (assume it is =1).

I don’t set it to =1 in the my.cnf.cz[abc]_bee configs for MariaDB 10.2, 10.3, 10.4

innodb_flush_method was removed in MariaDB 11.4 and there is a new way to configure this.

In 11.4.1 there is an equivalent of =O_DIRECT but not of =O_DIRECT_NO_FSYNC

For MariaDB the typical my.cnf files were:

my.cnf.cz11a_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC
Two configs that are only used with MariaDB 10.4 to make it closer to 10.5+

my.cnf.cz11abpi1_bee – like my.cnf.cz11a_bee except sets innodb_buffer_pool_instances and innodb_page_cleaners to =1. This config is used for makes MairaDB 10.4
my.cnf.cz11aredo1_bee – like my.cnf.cz11a_bee except uses one big InnoDB redo log instead of 10 smaller ones

my.cnf.cz11b_bee – uses innodb_flush_method=O_DIRECT
my.cnf.cz11c_bee – uses innodb_flush_method=fsync
my.cnf.cz11d_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC and enables the InnoDB change buffer

For MySQL the typical my.cnf files were:

my.cnf.cz11a_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC
my.cnf.cz11b_bee – uses innodb_flush_method=O_DIRECT
my.cnf.cz11c_bee – uses innodb_flush_method=fsync
my.cnf.cz11d_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC and disables the InnoDB change buffer. Note the my.cnf.cz11[abc]_bee configs for MySQL enabled it. This is the opposite of what is done for MariaDB.

The Benchmark

The benchmark is run with 4 clients (some concurrency), a cached workload and 4 tables (table per client). It is explained here. The initial load (l.i0) inserts 30M rows and the database fits in memory.

The test server was named v4 here and is a Beelink SER4. It has 8 cores, 16G RAM, Ubuntu 22.04 and XFS using 1 m.2 device.

The benchmark steps are:

l.i0

insert 8 million rows per table in PK order (32M rows in total). Each table has a PK index but no secondary indexes. There is one connection per client.

create 3 secondary indexes per table. There is one connection per client.

l.i1

use 2 connections/client. One inserts 40M rows and the other does deletes at the same rate as the inserts. Each transaction modifies 50 rows (big transactions). This step is run for a fixed number of inserts, so the run time varies depending on the insert rate.

l.i2

like l.i1 but each transaction modifies 5 rows (small transactions) and 10M rows are inserted and deleted.
Wait for X seconds after the step finishes to reduce variance during the read-write benchmark steps that follow. The value of X is a function of the table size.

qr100

use 3 connections/client. One does range queries and performance is reported for this. The second does does 100 inserts/s and the third does 100 deletes/s. The second and third are less busy than the first. The range queries use covering secondary indexes. This step is run for 1800 seconds. If the target insert rate is not sustained then that is considered to be an SLA failure. If the target insert rate is sustained then the step does the same number of inserts for all systems tested.

qp100

like qr100 except uses point queries on the PK index

qr500

like qr100 but the insert and delete rates are increased from 100/s to 500/s

qp500

like qp100 but the insert and delete rates are increased from 100/s to 500/s

qr1000

like qr100 but the insert and delete rates are increased from 100/s to 1000/s

qp1000

like qp100 but the insert and delete rates are increased from 100/s to 1000/s

Results

The performance reports are here for:

MariaDB 10.11, MariaDB 11.4, MySQL 8.0 release+config variations
most MariaDB release+config variations
MariaDB 10.4 release+config variations
MariaDB 10.11 release+config variations
MariaDB 11.4 release+config variations
most MySQL release+config variations
all DBMS

The summary in each performance report has 3 tables. The first shows absolute throughput by DBMS tested X benchmark step. The second has throughput relative to the version from the first row of the table. The third shows the background insert rate for benchmark steps with background inserts and all systems sustained the target rates. The second table makes it easy to see how performance changes over time. The third table makes it easy to see which DBMS+configs failed to meet the SLA.

Below I use relative QPS to explain how performance changes. It is: (QPS for $me / QPS for $base) where $me is my version and $base is the version of the base case. When relative QPS is > 1.0 then performance improved over time. When it is < 1.0 then there are regressions. The Q in relative QPS measures:

insert/s for l.i0, l.i1, l.i2
indexed rows/s for l.x
range queries/s for qr100, qr500, qr1000
point queries/s for qp100, qp500, qp1000

Below I use colors to highlight the relative QPS values with red for <= 0.95, green for >= 1.05 and grey for values between 0.95 and 1.05.

Results: MariaDB vs MySQL

Modern MariaDB (11.4.1) is faster than modern MySQL (8.0.36) on all benchmark steps except for qr* (range query) and l.x (create index) where they have similar performance. This matches the low concurrency results.

From the summary:

The base case is MariaDB 10.11.7 with the cz11a_bee config (ma101107_rel.cz11a_bee). It is compared with

MariaDB 11.4.1 with the cz11b_bee config (ma110401_rel.cz11b_bee)
MySQL 8.0.36 with the cz11a_bee config (my8036_rel.cz11a_bee)

Relative throughput per benchmark step

l.i0

relative QPS is 0.99 in MariaDB 11.4.1
relative QPS is 0.75 in MySQL 8.0.36

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 0.88, 0.98 in MariaDB 11.4.1
relative QPS is 0.52, 0.54 in MySQL 8.0.36

qr100, qr500, qr1000

relative QPS is 0.99, 1.01, 1.00 in MariaDB 11.4.1
relative QPS is 1.08, 1.05, 1.06 in MySQL 8.0.36

qp100, qp500, qp1000

relative QPS is 0.98, 1.00, 1.00 in MariaDB 11.4.1
relative QPS is 0.77, 0.79, 0.77 in MySQL 8.0.36

Results: MariaDB

Modern MariaDB (11.4.1) was at most 12% slower than older MariaDB (10.2.44). This matches the low concurrency results. MariaDB has done a great job of avoiding performance regressions over time.

From the summary:

The base case is MariaDB 10.2.44 with the cz11a_bee config (ma100244_rel.cz11a_bee). It is compared with more recent LTS releases from 10.3, 10.4, 10.5, 10.6, 10.11 and 11.4.
Throughput per benchmark step for 11.4.1 relative to 10.2.44

l.i0

relative QPS is 0.88 in MariaDB 11.4.1

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 0.99, 1.09 in MariaDB 11.4.1

qr100, qr500, qr1000

relative QPS is 0.88, 0.93, 0.92 in MariaDB 11.4.1

qp100, qp500, qp1000

relative QPS is 0.93, 0.98, 1.00 in MariaDB 11.4.1

Results: MariaDB 10.4

From the summary:

The base case is MariaDB 10.4.33 with the cz11a_bee config (ma100433_rel.cz11a_bee).
It is compared with MariaDB 10.4.33 using the cz11abpi1_bee and cz11aredo1 configs
MariaDB with the cz11abpi1_bee config does worse on the l.i1 and l.i2 benchmark steps

There might be more contention on the one buffer pool instance
From the metrics with the l.i1 benchmark step I see larger values for context switches per query (cspq) and CPU per query (cpupq). I also see a lower CPU utilization (cpups) and write back rate (wmbps is storage MB/s written)

MariaDB with the cz11aredo1_bee config does worse on the qp* benchmark steps

From the metrics with the qp* benchmark steps I see a larger value for CPU per query (cpupq)

Throughput per benchmark step relative to the base case

l.i0

relative QPS is 0.99 with cz11abpi1_bee
relative QPS is 1.01 with cz11aredo1_bee

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 0.90, 0.94 with cz11abpi1_bee
relative QPS is 1.00, 1.00 with cz11aredo1_bee

qr100, qr500, qr1000

relative QPS is 1.00, 1.00, 1.01 with cz11abpi1_bee
relative QPS is 0.99, 0.98, 0.99 with cz11aredo1_bee

qp100, qp500, qp1000

relative QPS is 0.99, 1.01, 1.00 with cz11abpi1_bee
relative QPS is 0.95, 0.95, 0.95 with cz11aredo1_bee

Results: MariaDB 10.11

From the summary:

The base case is MariaDB 10.11.7 with the cz11a_bee config (ma101107_rel.cz11a_bee) and 10.11.7 releases with other configs are compared to it.
The results here are similar to the low-concurrency results except that the results here for the cz11b_bee and cz11c_bee configs are much worse for several of the write-heavy steps than they are in the low-concurrency test.

Results: MariaDB 11.4

From the summary:

The base case is MariaDB 11.4.1 with the cz11b_bee config (ma110401.cz11b_bee) and 11.4.1 releases with the cz11c_bee config is compared to it. Note that 11.4.1 does not support the equivalent of O_DIRECT_NO_FSYNC for innodb_flush_method.
For the cz11c_bee config

Performance for the write heavy steps (l.i0, l.i1, l.i2) is ~10% worse than the base case. This issue doesn’t repeat on the low-concurrency results.
Performance for qp100 is ~7% worse than the base case. This is similar to the low-concurrency results.

Results: MySQL

There are significant performance regressions from MySQL 5.6 to 8.0 and the results here are similar to the low-concurrency results.

From the summary:

The base case is MySQL 5.6.51 with the cz11a_bee config (my5651_rel.cz11a_bee) and it is compared to MySQL 5.7.44 and 8.0.36.
Relative throughput per benchmark step

l.i0

relative QPS is 0.84 in MySQL 5.7.44
relative QPS is 0.60 in MySQL 8.0.36

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 1.33, 1.17 in MySQL 5.7.44
relative QPS is 0.78, 0.72 in MySQL 8.0.36

qr100, qr500, qr1000

relative QPS is 0.79, 0.80, 0.80 in MySQL 5.7.44
relative QPS is 0.69, 0.68, 0.69 in MySQL 8.0.36

qp100, qp500, qp1000

relative QPS is 0.82, 0.86, 0.81 in MySQL 5.7.44
relative QPS is 0.64, 0.67, 0.64 in MySQL 8.0.36

The post The Insert Benchmark: MariaDB, MySQL, small server, cached workload, some concurrency appeared first on MariaDB.org.

Deploying Percona Everest on GCP with Kubectl for Windows 11 Users

2024-04-19T00:00:00+00:00

Welcome to this blog post! Today, our primary goal is to guide you through deploying Percona Everest on GCP using Kubectl, specifically for users on Windows 11. It’s been some time since I last used Windows, so this will be an excellent opportunity to do it from scratch.
Let me tell you a little bit about Percona Everest. You may have already heard it recently. It is the new open source tool launched by Percona and is already well-received by Kubernetes database users.

The post Deploying Percona Everest on GCP with Kubectl for Windows 11 Users appeared first on MariaDB.org.

Explain Hypothetical Scenario: The Application Rolls Back the Transaction in MySQL

2024-04-18T19:45:11+00:00

In a hypothetical scenario where an application rolls back a transaction in MySQL, especially when using the InnoDB storage engine, several key processes and implications come into play. This scenario can serve as an instructive example of how transaction control mechanisms work within MySQL to ensure data integrity and consistency. Here’s a breakdown of what happens and the potential effects:

1. Transaction Initiation and Changes

When a transaction is initiated in MySQL using InnoDB, any changes made to the data during the transaction are logged in the transaction log. These changes could include updates, inserts, or deletes affecting table rows. However, these changes are not immediately made permanent in the database; instead, they exist in a tentative state where they can be committed or rolled back.

2. Request for Rollback

A rollback might be triggered by several scenarios:

Explicit Command: The application issues a ROLLBACK; command because it encountered an error or a specific condition that the application logic determines as requiring a reversal of the changes.
Implicit Trigger: The database itself might trigger a rollback if it encounters an error during the transaction processing, such as integrity constraint violations, deadlock situations, or system errors.

3. Execution of Rollback

When the rollback is triggered:

Undo Logs: InnoDB utilizes undo logs that record information needed to revert the changes made by a transaction. These logs are crucial because they allow the database to reverse any modifications made during the transaction.
Reverting Changes: The database uses the undo logs to revert the affected database rows to their original state prior to the transaction.
Clearing Buffers: Any changes that were held in the buffer and not yet flushed to disk are cleared, ensuring that none of the uncommitted changes are written to the physical database files.

4. Release of Locks

Transactions typically involve locking mechanisms to maintain isolation levels and prevent data anomalies. During a rollback:

Lock Release: All locks held by the transaction are released. This release is important to prevent deadlocks and allow other transactions waiting on these locks to proceed.

5. Impact on System Performance

Resource Utilization: Rollbacks can be resource-intensive, especially if the transaction involved significant data modifications. The process of reading undo logs and restoring previous states consumes CPU and I/O resources.
Concurrency and Throughput: Frequent rollbacks might impact the overall throughput of the system. If transactions frequently need to be rolled back, it could indicate deeper issues in the application logic or database design that might need optimization.

6. Logging and Monitoring

Audit Trails: Most robust systems log transaction rollbacks for audit and diagnostic purposes. Monitoring these logs can help in identifying patterns or recurring issues that lead to rollbacks.
Performance Metrics: It’s also wise to monitor performance metrics around transaction handling, such as rollback rates, to gauge the health and efficiency of the transaction management system.

7. Potential Issues and Considerations

Data Consistency: While rollbacks are essential for maintaining data integrity, frequent rollbacks could point to issues in application logic, user input handling, or conflict resolution strategies in the database.
Optimization Opportunities: Analyzing the causes and contexts of frequent rollbacks might provide insights into opportunities for optimizing transaction processing, improving application responses, and enhancing user experience.

In conclusion, a rollback is a vital feature of transactional databases like MySQL’s InnoDB, designed to ensure data integrity by enabling the database to revert to a consistent state in case of errors or specific conditions dictated by the application logic. Understanding the mechanics and implications of this process is crucial for database administration and application development.

What happens to uncommitted transactions in MySQL if the server crashes after the update?

InnoDB Multi-Versioning (MVCC)

How fast growing undo logs impact MySQL performance?

InnoDB early lock release

The post Explain Hypothetical Scenario: The Application Rolls Back the Transaction in MySQL appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post Explain Hypothetical Scenario: The Application Rolls Back the Transaction in MySQL appeared first on MariaDB.org.

Changes to managing turbo boost in Ubuntu 22.04 and Linux 6.5

2024-04-18T17:53:00+00:00

I often use HWE kernels with Ubuntu and currently use Ubuntu 22.04. Until recently that meant I ran Linux 6.2 but after a recent update I am now on Linux 6.5.

I am far from an expert on this topic and what I write here might just be notes to myself. Be wary of following my advice.

Disabling turbo boost yesterday

I have been disabling turbo boost for many years on my home test servers to reduce performance variance from hardware, especially as the weather gets warm because I don’t have a server room with AC. The problem with turbo boost on some of my servers was cyclical behavior:

CPU cools, turbo boost does its thing
benchmark runs faster
CPU gets hot
turbo boost stops doing its thing
benchmark runs slower
repeat

On my Intel servers I disable turbo boost via BIOS settings. On my AMD servers that used to be done via a script because I was using acpi-cpufreq: echo 0 > /sys/devices/system/cpu/cpufreq/boost

My goal is repeatable performance and I am willing to sacrifice peak HW performance to get that. Avoiding the cycle described above helps to achieve that. Alas this is a spectrum — I tolerate other things (CPU cache, database cache) that improve performance while adding variance. I assume that I want CPU frequency to stay within a narrow range. It isn’t clear that even when using acpi-cpufreq that I was getting a narrow range, but it did help.

From the Ryzen 7 7840HS CPU I am use on these servers the AMD specs state that the base speed is 3.8GHz and the max boost is up to 5.1GHz. With acpi-cpufreq the CPU cores can be in one of three frequency levels, and from cpupower frequency-info they are:

available frequency steps: 3.80 GHz, 2.20 GHz, 1.60 GHz

So even with turbo boost disabled (see the echo command above) there is still room for variance. But I don’t know enough to determine whether I need to do more tuning.

Disabling turbo boost today

After a recent update on Ubuntu 22.04 with HWE kernels I now run 6.5.0-27-generic and acpi-cpufreq has been replaced by amd-pstate. I am sure there are many benefits from this change, alas, it also brings complexity and confusion from users who now have server cooling problems (because things are running faster) and are trying to figure out how to fix them. Notes on setting up the server are here.

I noticed this change because with the the default (amd-pstate in active mode) this file doesn’t exist:

/sys/devices/system/cpu/cpufreq/boost

On a Ryzen 7 CPU I get the amd-pstate-epp driver in active mode. Output from /proc/cpuinfo and cpupower frequency-info from this state is below. Note that /sys/devices/system/cpu/cpufreq/boost doesn’t exist when in active mode. It does exist when in guided or passive mode. So I either need to switch to guided or passive mode or rollback to using the acpi-cpufreq driver. Which means I need to understand a bit more.

There is a lot of documentation for the amd-pstate driver. It isn’t meant for the casual user.

There is a big difference between acpi-cpufreq and amd-pstate and amd-pstate is the future but perhaps not today (for me). While with acpi-cpufreq and turbo boost disabled I should only get one of three CPU frequencies, I can get many more with amd-pstate. From cpupower frequency-info output

analyzing CPU 7:
driver: amd-pstate-epp
CPUs which run at the same hardware frequency: 7
CPUs which need to have their frequency coordinated by software: 7
maximum transition latency: Cannot determine or is not supported.
hardware limits: 400 MHz – 5.61 GHz
available cpufreq governors: performance powersave
current policy: frequency should be within 400 MHz and 5.61 GHz.
The governor “powersave” may decide which speed to use
within this range.
current CPU frequency: Unable to call hardware
current CPU frequency: 2.97 GHz (asserted by call to kernel)
boost state support:
Supported: yes
Active: no

For now I will just rollback to using the acpi-cpufreq driver while figuring this out and possibly waiting for Linux 6.6 to show up on Ubuntu 22.04. I am not sure how mature amd-pstate is, and I won’t get support for cpupower set –turbo-boost 1 until 6.6 arrives.

I now have this in /etc/default/grub:

GRUB_CMDLINE_LINUX_DEFAULT=”pcie_aspm=off nosmt amd_pstate=disable”

pcie_aspm=off is there to avoid correctable PCI errors (maybe Beelink BIOS needs an update)
nosmt disables hyperthreads because BIOS doesn’t have an option for that
amd_pstate=disable lets me use the acpi-cpufreq driver

CPU frequencies with acpi-cpufreq

This shows the CPU frequencies I get from an idle server with the acpi-cpufreq driver. Note that I mostly get only 3 values when boost is disabled (set to 0).

With /sys/devices/system/cpu/cpufreq/boost set to 0

current CPU frequency: 2.18 GHz (asserted by call to kernel)
current CPU frequency: 1.50 GHz (asserted by call to kernel)
current CPU frequency: 3.80 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 3.80 GHz (asserted by call to kernel)

With /sys/devices/system/cpu/cpufreq/boost set to 1

current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 2.04 GHz (asserted by call to kernel)
current CPU frequency: 2.11 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 1.57 GHz (asserted by call to kernel)
current CPU frequency: 1.60 GHz (asserted by call to kernel)
current CPU frequency: 3.21 GHz (asserted by call to kernel)

Appendix

Note that cpupower frequency-info only shows frequencies for one core, to see them all use cpupower -c all frequency-info.

Output from cpupower frequency-info with active mode

Output from cpupower frequency-info with guided mode

analyzing CPU 7:
driver: amd-pstate
CPUs which run at the same hardware frequency: 7
CPUs which need to have their frequency coordinated by software: 7
maximum transition latency: 20.0 us
hardware limits: 400 MHz – 5.61 GHz
available cpufreq governors: conservative ondemand userspace powersave performance schedutil
current policy: frequency should be within 400 MHz and 5.61 GHz.
The governor “schedutil” may decide which speed to use
within this range.
current CPU frequency: Unable to call hardware
current CPU frequency: 1.44 GHz (asserted by call to kernel)
boost state support:
Supported: yes
Active: yes
AMD PSTATE Highest Performance: 214. Maximum Frequency: 5.61 GHz.
AMD PSTATE Nominal Performance: 145. Nominal Frequency: 3.80 GHz.
AMD PSTATE Lowest Non-linear Performance: 42. Lowest Non-linear Frequency: 1.10 GHz.
AMD PSTATE Lowest Performance: 16. Lowest Frequency: 400 MHz.

Output from cpupower frequency-info with passive mode

analyzing CPU 7:
driver: amd-pstate
CPUs which run at the same hardware frequency: 7
CPUs which need to have their frequency coordinated by software: 7
maximum transition latency: 20.0 us
hardware limits: 400 MHz – 5.61 GHz
available cpufreq governors: conservative ondemand userspace powersave performance schedutil
current policy: frequency should be within 400 MHz and 5.61 GHz.
The governor “schedutil” may decide which speed to use
within this range.
current CPU frequency: Unable to call hardware
current CPU frequency: 2.74 GHz (asserted by call to kernel)
boost state support:
Supported: yes
Active: yes
AMD PSTATE Highest Performance: 214. Maximum Frequency: 5.61 GHz.
AMD PSTATE Nominal Performance: 145. Nominal Frequency: 3.80 GHz.
AMD PSTATE Lowest Non-linear Performance: 42. Lowest Non-linear Frequency: 1.10 GHz.
AMD PSTATE Lowest Performance: 16. Lowest Frequency: 400 MHz.

Output from /proc/cpuinfo

processor : 7
vendor_id : AuthenticAMD
cpu family : 25
model : 116
model name : AMD Ryzen 7 7840HS w/ Radeon 780M Graphics
stepping : 1
microcode : 0xa704103
cpu MHz : 3800.000
cache size : 1024 KB
physical id : 0
siblings : 8
core id : 7
cpu cores : 8
apicid : 14
initial apicid : 14
fpu : yes
fpu_exception : yes
cpuid level : 16
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good amd_lbr_v2 nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba perfmon_v2 ibrs ibpb stibp ibrs_enhanced vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local avx512_bf16 clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif x2avic v_spec_ctrl vnmi avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq rdpid overflow_recov succor smca flush_l1d
bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass srso
bogomips : 7585.46
TLB size : 2560 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm hwpstate cpb eff_freq_ro [13] [14] [15]

The post Changes to managing turbo boost in Ubuntu 22.04 and Linux 6.5 appeared first on MariaDB.org.

How MariaDB and MySQL performance changed over releases

2024-04-17T17:02:54+00:00

Is performance important for you, along with the latest features and long-term support? Go with MariaDB 11.4. But don’t take our word for it. We asked well known benchmarking expert Mark Callaghan to check out a number of MariaDB and MySQL releases, hit them hard with a tool of his choice, and share his findings. …

Continue reading “How MariaDB and MySQL performance changed over releases”

The post How MariaDB and MySQL performance changed over releases appeared first on MariaDB.org.

Integrating Conflict-Free Replicated Data Types (CRDTs) with PostgreSQL for Distributed Systems

2024-04-17T09:31:49+00:00

Conflict-free Replicated Data Types (CRDTs) are advanced data structures that enable distributed databases to achieve high availability and strong eventual consistency without requiring complex conflict resolution procedures. Although PostgreSQL itself does not natively support CRDTs directly in its core functionality, understanding CRDTs can be valuable for those working with distributed systems and looking to integrate PostgreSQL within such environments.

What are CRDTs?

CRDTs are data types designed to simplify the development of resilient, scalable distributed systems. They allow multiple participants (nodes in a distributed system) to update data independently without central coordination, and then merge these updates in a way that resolves inconsistencies and conflicts predictably and automatically.

CRDTs typically come in two main types:

State-based CRDTs (CvRDTs):
- Concept: Each node maintains its own local state, which is updated independently.
- Synchronization: Nodes periodically synchronize their state by transmitting their entire state or a delta of the state to other nodes.
- Merging: States are merged using a function that is associative, commutative, and idempotent, ensuring all nodes converge towards the same state.
Operation-based CRDTs (CmRDTs):
- Concept: Instead of sharing state, nodes broadcast update operations to all other nodes.
- Requirements: Operations must be commutative (order-independent) and idempotent (re-applying an operation has no further effect).
- Reliability: This approach typically relies on a reliable broadcast mechanism to ensure delivery and ordering of operations.

Relevance to PostgreSQL

While PostgreSQL does not implement CRDTs within its internal mechanisms, understanding how to model data in ways compatible with CRDT principles can be beneficial, especially when using PostgreSQL as part of a distributed system. Here are some approaches and considerations for integrating CRDT concepts with PostgreSQL:

Application-Layer CRDTs:
- Implementation: Implement CRDT logic in the application layer using a programming language of your choice. PostgreSQL can be used to store the state of CRDTs or the operations if you are using an operation-based approach.
- Example Use Case: A collaborative application where users can edit shared documents or data concurrently.
Using Extensions and External Tools:
- Tools such as AntidoteDB and other databases designed for distributed systems can be integrated with PostgreSQL to manage CRDTs externally. Data can be replicated between PostgreSQL and these systems, leveraging their CRDT capabilities.
Custom Stored Procedures:
- Implement custom stored procedures in PostgreSQL that mimic CRDT operations. This could involve creating functions to merge divergent data states based on predefined rules.
Trigger-Based Replication:
- Use triggers in PostgreSQL to capture changes and replicate these changes across distributed instances in a way that could be aligned with CRDT operational behaviors.

Challenges and Considerations

Performance: Implementing CRDTs at the application layer or integrating external tools can introduce performance overhead, especially in high-latency networks.
Complexity: Designing and maintaining custom solutions for CRDT-like behaviors in PostgreSQL requires careful planning and robust testing.
Consistency: While CRDTs provide strong eventual consistency, they might not always guarantee immediate consistency, which can be a critical requirement for certain types of applications.

Conclusion

Although PostgreSQL does not natively support CRDTs, the principles underlying CRDTs can be applied through custom application logic or by integrating with specialized tools that support CRDTs. This approach allows PostgreSQL to be effectively used in distributed systems where data consistency and high availability are paramount. It’s important to carefully evaluate the trade-offs and ensure that the chosen approach aligns with the specific requirements of your distributed application or system.

Clustered Index Design considerations in PostgreSQL

Efficient Integration of PostgreSQL 16 with LDAP: Best Practices and Tips

MinervaDB Server for PostgreSQL

The post Integrating Conflict-Free Replicated Data Types (CRDTs) with PostgreSQL for Distributed Systems appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post Integrating Conflict-Free Replicated Data Types (CRDTs) with PostgreSQL for Distributed Systems appeared first on MariaDB.org.

Release Roundup April 17, 2024

2024-04-17T00:00:00+00:00

Percona software releases and updates April 2 – April 17, 2024.
Percona is a leading provider of unbiased, performance-first, open source database solutions that allow organizations to easily, securely, and affordably maintain business agility, minimize risks, and stay competitive, free from vendor lock-in. Percona software is designed for peak performance, uncompromised security, limitless scalability, and disaster-proofed availability.
Our Release Roundups showcase the latest Percona software updates, tools, and features to help you manage and deploy our software.

The post Release Roundup April 17, 2024 appeared first on MariaDB.org.

MariaDB Completes Nutanix Ready Certification for AI and Data-Driven Hybrid and Multicloud Applications

2024-04-16T16:30:15+00:00

In the fast-paced world of enterprise databases, reliability, scalability, and performance are paramount. Cloud deployment flexibility is critically important to many companies and applications. While some applications are mandated for private cloud deployment, others are suitable for public cloud, and must even contend with multiple clouds. Others require scalable architectures for AI-driven…

Source

The post MariaDB Completes Nutanix Ready Certification for AI and Data-Driven Hybrid and Multicloud Applications appeared first on MariaDB.org.

A Guide to Better Understanding MySQL Charset Levels

2024-04-16T13:24:05+00:00

We usually receive and see some questions regarding the charset levels in MySQL, especially after the deprecation of utf8mb3 and the new default uf8mb4. If you understand how the charset works on MySQL but have some questions regarding this change, please check out Migrating to utf8mb4: Things to Consider by Sveta Smirnova.Some of the questions […]

The post A Guide to Better Understanding MySQL Charset Levels appeared first on MariaDB.org.

MariaDB Contribution Statistics, April 2024

2024-04-16T09:04:34+00:00

With the first quarter of 2024 out of the way, we can take a look at the contribution statistics for the last three months. For the Foundation, this has been a very busy quarter, we have had a few big events to prepare for and attend. …

Continue reading “MariaDB Contribution Statistics, April 2024”

The post MariaDB Contribution Statistics, April 2024 appeared first on MariaDB.org.

How do Bloom Indexes work in PostgreSQL?

2024-04-15T14:01:07+00:00

Bloom indexes in PostgreSQL are a specialized type of index that provides an efficient way to test whether an element is a member of a set. They are particularly useful for speeding up queries involving multiple columns where traditional b-tree indexes would be less efficient. This makes them ideal for situations where you want to check the presence of a combination of values across several columns without the need to index each combination explicitly.

How Bloom Indexes Work?

Bloom indexes use a probabilistic data structure called a Bloom filter. Here’s how they function in PostgreSQL:

Hash Functions: A Bloom filter uses multiple hash functions to map each element of a dataset to several positions in a fixed-size bit array. Each element is hashed several times, and the bits at the resulting positions in the bit array are set to 1.
Insertion: When adding a value to the index, PostgreSQL applies the hash functions used in the Bloom filter to the value and sets the bits at the calculated positions in the bit array to 1.
Querying: To check if a value is present in the set, PostgreSQL hashes the value with the same hash functions and checks the bit array. If all the bits at the calculated positions are 1, the value might be in the set; if any bit is 0, the value is definitely not in the set.
False Positives: One of the characteristics of Bloom filters is that they can yield false positives but not false negatives. This means a query on a Bloom index can incorrectly indicate the presence of a value (although it’s not there), but it will never fail to detect a value that is present.

Use in PostgreSQL

In PostgreSQL, you can create a Bloom index using the bloom access method, which needs to be installed from the bloomextension if not already available:

CREATE EXTENSION bloom;

To create a Bloom index on a table, specify the columns you want to include:

CREATE INDEX ON my_table USING bloom (column1, column2, column3);

You can also tune the Bloom index by setting parameters like the length of the bit array (length) and the number of hash functions (colwidth). These parameters balance between the space used by the index and the accuracy (likelihood of false positives).

Advantages of Bloom Indexes

Space Efficiency: Bloom indexes are much more space-efficient than standard indexes, especially for composite indexing of multiple columns.
Performance: They can greatly improve the performance of queries that test for the presence of multiple attribute combinations, as they reduce the need for multiple or composite b-tree indexes.
Flexibility: They support indexing on several columns together without the need to create individual indexes on each combination of columns.

Limitations

False Positives: The possibility of false positives means that while Bloom indexes can speed up query performance, they may require additional filtering after the index lookup to ensure accuracy.
Write Cost: Like all indexes, maintaining a Bloom index has a cost in terms of data modification operations. Each insert, update, or delete in the table may require updates to the Bloom index.

Conclusion

Bloom indexes are a powerful feature in PostgreSQL for scenarios involving quick membership testing across multiple columns. They are particularly useful for large datasets where traditional indexing strategies would consume too much space or degrade performance. However, it’s essential to understand their probabilistic nature and plan for handling false positives in application logic.

How do Bloom Filters Work in MyRocks?

RocksDB use case: Building ultra low-latency Mobile Advertising Network

How to identify strings that can be treated as numbers in PostgreSQL?

The post How do Bloom Indexes work in PostgreSQL? appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post How do Bloom Indexes work in PostgreSQL? appeared first on MariaDB.org.

Why SELECT COUNT(*) FROM TABLE Is Sometimes Very Slow in MySQL or MariaDB

2024-04-15T13:02:26+00:00

If you have enough experience with MySQL, it is very possible that you stumbled upon an unusually slow SELECT COUNT(*) FROM TABLE; query execution, at least occasionally.Recently, I had a chance to investigate some of these cases closer, and it stunned me what huge differences there can be depending on the circumstance given the very […]

The post Why SELECT COUNT(*) FROM TABLE Is Sometimes Very Slow in MySQL or MariaDB appeared first on MariaDB.org.

Comprehensive Guide to Aggregate Functions in PostgreSQL

2024-04-14T18:09:27+00:00

PostgreSQL is equipped with a robust suite of statistical functions that are essential for performing detailed data analysis directly within the database. These functions allow users to calculate various statistical measures, such as averages, variances, standard deviations, and correlations, directly from stored data. This capability not only streamlines the analytical process but also enhances performance by leveraging PostgreSQL’s powerful query optimization. Whether you’re analyzing financial data, customer metrics, or scientific measurements, PostgreSQL’s statistical functions provide the tools necessary to derive meaningful insights without the need for additional statistical software. This integration of statistical analysis into the database engine supports a wide range of applications, from business intelligence and market research to scientific research and predictive analytics, making PostgreSQL an invaluable tool for data scientists and analysts alike, Here is a comprehensive list of the different types of aggregate functions available, categorized by their use cases:

1. General-Purpose Aggregates

COUNT(expression): Counts non-null values in a set.
SUM(expression): Sums up the values of the expression.
AVG(expression): Calculates the average of the expression values.
MIN(expression): Finds the minimum value of the expression.
MAX(expression): Finds the maximum value of the expression.

2. Statistical Aggregates

STDDEV(expression): Computes the standard deviation of the input values.
STDDEV_POP(expression): Computes the population standard deviation.
STDDEV_SAMP(expression): Computes the sample standard deviation.
VAR_POP(expression): Computes the population variance.
VAR_SAMP(expression): Computes the sample variance.
VARIANCE(expression): Alias for VAR_SAMP.
COVAR_POP(expression1, expression2): Calculates the population covariance between two expressions.
COVAR_SAMP(expression1, expression2): Calculates the sample covariance.
CORR(expression1, expression2): Calculates the correlation coefficient.

3. Ordered-Set Aggregates

MODE(): Returns the mode of a set of values (the value that appears most frequently).
PERCENTILE_CONT(fraction): Computes a percentile based on a continuous distribution of the column values.
PERCENTILE_DISC(fraction): Computes a percentile based on a discrete distribution of the column values.

4. Hypothetical-Set Aggregates

Similar to ordered-set aggregates but include a hypothetical element:
- RANK(expression)
- DENSE_RANK(expression)
- PERCENT_RANK(expression)
- CUME_DIST(expression)

5. Array Aggregates

ARRAY_AGG(expression): Aggregates values, including nulls, into an array.
STRING_AGG(expression, delimiter): Concatenates input values into a string, separated by a delimiter.

6. JSON Aggregates

JSON_AGG(expression): Aggregates values as a JSON array.
JSONB_AGG(expression): Aggregates values as a JSONB array.
JSON_OBJECT_AGG(key, value): Aggregates input as a JSON object.
JSONB_OBJECT_AGG(key, value): Aggregates input as a JSONB object.

7. XML Aggregates

XMLAGG(expression): Concatenates XML values.

8. Grouping Operations

GROUPING(expression): Identifies the level of aggregation, particularly in GROUP BY GROUPING SETS.

Specialized Aggregates

BIT_AND(expression): Performs a bitwise AND operation on all input values.
BIT_OR(expression): Performs a bitwise OR operation on all input values.
BOOL_AND(expression): Returns TRUE if all input values are TRUE.
BOOL_OR(expression): Returns TRUE if at least one input value is TRUE.

These aggregate functions are versatile tools for data analysis, allowing complex statistical, mathematical, and group-based operations to be performed directly within SQL queries, enhancing the analytical capabilities of PostgreSQL.

Exploring PostgreSQL: How Tables are Stored and Indexed for Optimal Performance

2024-04-14T09:55:47+00:00

PostgreSQL employs sophisticated mechanisms for storage and indexing of tables, leveraging its extensible architecture to optimize data retrieval and maintain data integrity. Understanding these mechanisms helps in optimizing database performance and ensuring efficient data management.

Storage in PostgreSQL

Table Storage:
- Heap Files: Every table in PostgreSQL is stored as a collection of unordered rows in one or more disk files, generally referred to as heap files. Each row within a heap file is uniquely identified by a tuple identifier (CTID) which is a combination of the block number within the file and the index of the tuple within that block.
- TOAST (The Oversized-Attribute Storage Technique): PostgreSQL uses TOAST to efficiently store large table rows by compressing and breaking them up into multiple physical rows. This happens transparently in the background, ensuring that large values such as long text or binary data do not hinder the performance of the database.
Page Structure:
- PostgreSQL stores data in blocks or pages, typically 8KB in size. Each page can contain one or more tuples (rows) of data, depending on their size.

Indexing in PostgreSQL

PostgreSQL provides several index types, each suitable for different kinds of queries and data patterns:

B-Tree Indexes:
- The default and most commonly used index type in PostgreSQL.
- Ideal for data retrieval involving equality and range queries.
- Maintains data in a sorted order, allowing for efficient reads and writes.
Hash Indexes:
- Best suited for equality comparisons.
- Uses a hash table mechanism where it maps keys to table rows based on a hash function.
GiST (Generalized Search Tree) Indexes:
- A flexible, balanced tree structure that can support various types of searches.
- Commonly used for indexing geometric data and full-text searches.
GIN (Generalized Inverted Indexes):
- Optimized for handling cases where the items to be indexed are composite values, making it ideal for full-text search over documents.
- Stores an index entry for each element or word within a document or array, effectively inverting a mapping from documents to keywords.
BRIN (Block Range Indexes):
- Designed for large tables in which certain properties, like sequentiality, allow for the grouping of consecutive rows.
- Stores the summary information about the values in blocks of table data, which allows for quickly skipping over blocks that do not contain relevant data.
SP-GiST (Space-Partitioned Generalized Search Tree):
- Supports partitioning of the search space into non-overlapping regions and is suitable for data that does not fit well into a B-tree structure, such as non-balanced trees.

How Indexes Work

When a query is executed, PostgreSQL uses the index to quickly locate data without scanning every row in a table, significantly speeding up the query.
Indexes are especially valuable in read-heavy databases but do introduce some overhead for write operations because each INSERT, UPDATE, or DELETE operation on a table needs to update the indexes as well.

Maintenance

Regular index and table maintenance is necessary to keep the database performing well. Operations like VACUUM and REINDEX help maintain and optimize storage and index efficiency.

In summary, PostgreSQL’s approach to table storage and indexing is designed to offer robust data handling capabilities tailored to a variety of data types and query needs. This flexibility ensures that PostgreSQL can be optimized for a wide array of applications, from simple web applications to complex analytical systems.

The Insert Benchmark: MariaDB, MySQL, small server, cached workload

2024-04-13T18:18:00+00:00

This post has results for the Insert Benchmark on a small server with a cached workload. The goal is to compare MariaDB and MySQL.

This work was done by Small Datum LLC and sponsored by the MariaDB Foundation.

The workload here has low concurrency and the database is cached. The results might be different when the workload is IO-bound or has more concurrency.

tl;dr

Modern MariaDB (11.4.1) is faster than modern MySQL (8.0.36) on all benchmark steps except for qr* (range query) and l.x (create index) where they have similar performance.
Modern MariaDB (11.4.1) was at most 15% slower than older MariaDB (10.2.44). It is nice to see that MariaDB has done a great job of avoiding performance regressions over time.
There are significant performance regressions from older MySQL (5.6) to modern MySQL (8.0)
Performance with innodb_flush_method set to =O_DIRECT_NO_FSYNC is better than with =O_DIRECT is better than =fsync.

Build + Configuration

This report has results for InnoDB from:

MySQL – versions 5.6.51, 5.7.44 and 8.0.36
MariaDB – versions 10.2.44, 10.3.39, 10.4.33, 10.5.24, 10.6.17, 10.11.7, 11.4.1. Versions 10.2, 10.3, 10.4, 10.5, 10.6 and 10.11 are the most recent LTS releases and 11.4 will be the next LTS release.

innodb_max_purge_lag=500000

innodb_max_purge_lag_delay=1000000

Some of the changes were challenging when trying to make things comparable.

the InnoDB change buffer was removed in MariaDB 11.4.

I disable it in all my.cnf files for all MariaDB versions except for the my.cnf.cz11d and my.cnf.cz11d1 configs.
I don’t disable it for the MySQL configs named my.cnf.cz11[abc]_bee but I do disable it for the my.cnf.cz11d_bee config used by MySQL. The result is that for MariaDB the my.cnf.cz11d_bee config enables the change buffer while for MySQL it disables it. Sorry for the confusion.

innodb_buffer_pool_instances was removed in 10.5 (assume it is =1).

I don’t set it to =1 in the my.cnf.cz[abc]_bee configs for MariaDB 10.2, 10.3, 10.4

innodb_flush_method was removed in 11.4 and there is a new way to configure this.

In 11.4.1 there is an equivalent of =O_DIRECT but not of =O_DIRECT_NO_FSYNC

For MariaDB the typical my.cnf files were:

my.cnf.cz11a_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC
my.cnf.cz11a1_bee – like my.cnf.cz11a_bee but reduces the sizes of the buffer pool and redo log so that both fit in memory
my.cnf.cz11b_bee – uses innodb_flush_method=O_DIRECT
my.cnf.cz11b1_bee – like my.cnf.cz11b_bee but reduces the sizes of the buffer pool and redo log so that both fit in memory
my.cnf.cz11c_bee – uses innodb_flush_method=fsync
my.cnf.cz11d_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC and enables the InnoDB change buffer
my.cnf.cz11d1_bee – like my.cnf.cz11d_bee but reduces the sizes of the buffer pool and redo log so that both fit in memory

For MySQL the type my.cnf files were:

my.cnf.cz11a_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC
my.cnf.cz11b_bee – uses innodb_flush_method=O_DIRECT
my.cnf.cz11c_bee – uses innodb_flush_method=fsync
my.cnf.cz11d_bee – uses innodb_flush_method=O_DIRECT_NO_FSYNC and disables the InnoDB change buffer. Note the my.cnf.cz11[abc]_bee configs for MySQL enabled it. This is the opposite of what is done for MariaDB.

The Benchmark

The benchmark is run with 1 client. It is explained here and was run in two setups with a cached workload. The initial load (l.i0) inserts 30M rows and the database fits in memory. I recently updated the Insert Benchmark to change how deletes are done and the new delete SQL is here.

The test server was named v4 here and is a Beelink SER4. It has 8 cores, 16G RAM, Ubuntu 22.04 and XFS using 1 m.2 device.

The benchmark steps are:

l.i0

insert 30 million rows per table in PK order. The table has a PK index but no secondary indexes. There is one connection per client.

create 3 secondary indexes per table. There is one connection per client.

l.i1

use 2 connections/client. One inserts 40M rows and the other does deletes at the same rate as the inserts. Each transaction modifies 50 rows (big transactions). This step is run for a fixed number of inserts, so the run time varies depending on the insert rate.

l.i2

like l.i1 but each transaction modifies 5 rows (small transactions) and 10M rows are inserted and deleted.
Wait for X seconds after the step finishes to reduce variance during the read-write benchmark steps that follow. The value of X is a function of the table size.

qr100

use 3 connections/client. One does range queries and performance is reported for this. The second does does 100 inserts/s and the third does 100 deletes/s. The second and third are less busy than the first. The range queries use covering secondary indexes. This step is run for 1800 seconds. If the target insert rate is not sustained then that is considered to be an SLA failure. If the target insert rate is sustained then the step does the same number of inserts for all systems tested.

qp100

like qr100 except uses point queries on the PK index

qr500

like qr100 but the insert and delete rates are increased from 100/s to 500/s

qp500

like qp100 but the insert and delete rates are increased from 100/s to 500/s

qr1000

like qr100 but the insert and delete rates are increased from 100/s to 1000/s

qp1000

like qp100 but the insert and delete rates are increased from 100/s to 1000/s

Results

The performance reports are here for:

MariaDB 10.11, MariaDB 11.4, MySQL 8.0 release+config variations
most MariaDB release+config variations
MariaDB 10.11 release+config variations
MariaDB 11.4 release+config variations
most MySQL release+config variations
all DBMS

insert/s for l.i0, l.i1, l.i2
indexed rows/s for l.x
range queries/s for qr100, qr500, qr1000
point queries/s for qp100, qp500, qp1000

Below I use colors to highlight the relative QPS values with red for <= 0.95, green for >= 1.05 and grey for values between 0.95 and 1.05.

Results: MariaDB vs MySQL

Modern MariaDB (11.4.1) is faster than modern MySQL (8.0.36) on all benchmark steps except for qr* (range query) and l.x (create index) where they have similar performance.

From the summary:

The base case is MariaDB 10.11.7 with the cz11a_bee config (ma101107_rel.cz11a_bee). It is compared with

MariaDB 11.4.1 with the cz11b_bee config (ma110401_rel.cz11b_bee)
MySQL 8.0.36 with the cz11a_bee config (my8036_rel.cz11a_bee)

Relative throughput per benchmark step

l.i0

relative QPS is 0.98 in MariaDB 11.4.1
relative QPS is 0.68 in MySQL 8.0.36

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 0.93, 0.97 in MariaDB 11.4.1
relative QPS is 0.88, 0.78 in MySQL 8.0.36

qr100, qr500, qr1000

relative QPS is 1.03, 1.02, 1.01 in MariaDB 11.4.1
relative QPS is 1.02, 1.04, 1.04 in MySQL 8.0.36

qp100, qp500, qp1000

relative QPS is 1.00, 1.00, 1.00 in MariaDB 11.4.1
relative QPS is 0.76, 0.74, 0.75 in MySQL 8.0.36

Results: MariaDB

Modern MariaDB (11.4.1) was at most 15% slower than older MariaDB (10.2.44). It is nice to see that MariaDB has done a great job of avoiding performance regressions over time.

From the summary:

The base case is MariaDB 10.2.44 with the cz11a_bee config (ma100244_rel.cz11a_bee). It is compared with more recent LTS releases from 10.3, 10.4, 10.5, 10.6, 10.11 and 11.4.
Throughput per benchmark step for 11.4.1 relative to 10.2.44

l.i0

relative QPS is 0.86 in MariaDB 11.4.1

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 1.08, 1.07 in MariaDB 11.4.1

qr100, qr500, qr1000

relative QPS is 0.89, 0.86, 0.85 in MariaDB 11.4.1

qp100, qp500, qp1000

relative QPS is 0.91, 0.91, 0.88 in MariaDB 11.4.1

Results: MariaDB 10.11

From the summary:

The base case is MariaDB 10.11.7 with the cz11a_bee config (ma101107_rel.cz11a_bee) and 10.11.7 releases with other configs are compared to it.
Performance does not change with the cz11a1_bee config that uses a smaller size for the InnoDB redo log and buffer pool so that they fit in memory. In workloads that are more IO-bound I see that MariaDB 10.11+ does read IO for the redo log while MySQL does not. Shrinking the buffer pool and redo log isn’t good for performance but avoids the read IO.
Performance is worse with the cz11b_bee config on the random write (l.i1, l.i2) benchmark steps. This indicates that setting innodb_flush_method to O_DIRECT_NO_FSYNC is better than setting it to O_DIRECT.
Performance is worse with the cz11c_bee config on the random write (l.i1, l.i2) benchmark steps. This indicates that setting innodb_flush_method to O_DIRECT_NO_FSYNC is better than setting it to fsync.
Performance does not change with the cz11d_bee config which enabled the InnoDB change buffer. Given that the working set fits in the InnoDB buffer pool the result is expected.
Performance is worse with the cz11d1_bee config on the random write (l.i1, l.i2) benchmark steps. See the comments above for cz11a1_bee.

Results: MariaDB 11.4

From the summary:

The base case is MariaDB 11.4.1 with the cz11b_bee config (ma110401.cz11b_bee) and 11.4.1 releases with other configs are compared to it. Note that 11.4.1 does not support the equivalent of O_DIRECT_NO_FSYNC for innodb_flush_method.
Performance is worse with the cz11b1_bee config on the random write (l.i1, l.i2) benchmark steps. See the comments in the previous section on the cz11a1_bee config.
Performance is slightly worse with the cz11c_bee config on the random write (l.i1, l.i2) benchmark steps. This indicates that setting innodb_flush_method to O_DIRECT_NO_FSYNC is better than setting it to fsync.

Results: MySQL

There are significant performance regressions from MySQL 5.6 to 8.0.

From the summary:

The base case is MySQL 5.6.51 with the cz11a_bee config (my5651_rel.cz11a_bee) and it is compared to MySQL 5.7.44 and 8.0.36.
Relative throughput per benchmark step

l.i0

relative QPS is 0.84 in MariaDB 11.4.1
relative QPS is 0.57 in MySQL 8.0.36

l.x – I ignore this for now
l.i1, l.i2

relative QPS is 1.17, 0.89 in MariaDB 11.4.1
relative QPS is 1.07, 0.73 in MySQL 8.0.36

qr100, qr500, qr1000

relative QPS is 0.73, 0.72, 0.71 in MariaDB 11.4.1
relative QPS is 0.63, 0.63, 0.63 in MySQL 8.0.36

qp100, qp500, qp1000

relative QPS is 0.82, 0.79, 0.80 in MariaDB 11.4.1
relative QPS is 0.62, 0.60, 0.62 in MySQL 8.0.36

The post The Insert Benchmark: MariaDB, MySQL, small server, cached workload appeared first on MariaDB.org.

Understanding the Unnest Function in PostgreSQL: Transforming Arrays into Rows

2024-04-13T12:51:03+00:00

In PostgreSQL, the unnest() function is a powerful tool that serves the purpose of expanding arrays into a set of rows, effectively transforming array elements into individual row entries in a result set. Conceptually, this is akin to “flattening” the array, where each element of the array is transformed from being a part of a collective array structure into a standalone row in the database output. This is particularly useful in scenarios where data stored as arrays needs to be manipulated or analyzed using standard SQL operations, which typically operate on row-based data.

The utility of unnest() extends beyond simple arrays to include scenarios involving multiple arrays or even arrays of composite types. When multiple arrays are unnested in parallel, each array is expanded simultaneously, with corresponding elements forming a row together. This allows for the alignment of related data from different arrays, which is especially useful in analyses involving multiple linked data points, such as time-series data or paired measurements.

Furthermore, unnest() can handle arrays containing composite types, thus enabling complex data structures stored within arrays to be decomposed into simpler, query-able rows. This capability is crucial when dealing with multidimensional data that has been compactly stored in PostgreSQL’s advanced data types. By leveraging unnest(), users can effectively bridge the gap between the non-relational nature of array data and the relational paradigm of SQL, enhancing both the flexibility and power of database operations in PostgreSQL.

To learn more about the PostgreSQL Unnest function, Please download the whitepaper here.

How to query a REST API with MariaDB CONNECT engine

2024-04-12T23:52:01+00:00

Querying a remote REST API directly from the database: does it sound like pure madness? Some people will surely answer that yes, it does. So, first of all, let me spend a paragraph to show some great benefits of doing so. Then I’ll show you how to query a REST API with MariaDB CONNECT storage engine. I will explore a practical use case with Hashicorp Releases API.

The benefits of querying a REST API in SQL

The most obvious use case is for data analysts, data scientists or product managers. Some of them can’t program at all, but chances are that they know SQL and use it in their job. So they can’t write a script to gather the data they need from an API. But if they can just run an SQL query, their problem is solved. In no time, without involving anyone else.

Now to data engineers, developers and DevOps. Sometimes applications read data from a remote API. This means that, for every used API, some developers will have to write some code. When a very similar result can be achieved by writing a CREATE TABLE statement, I call it a productivity problem.

Another problem is about optimising a process. The typical approach is that an application retrieves some data from an API call, transforms them, and writes them into a database. Many things can go wrong, corrupting data or preventing an update. The process might be too slow, because data have to travel twice and the application has to process them. And sending some types of data to/from the application might increase security risks if your data is sensitive.

MariaDB CONNECT allows us to interact with a huge variety of data sources. This includes other databases, data files in many formats, and REST APIs. Every time you send a SELECT to a CONNECT table backed by a REST API, data will be retrieved via HTTP or HTTPS, and will be presented to the client in the form of a normal resultset.

This is obviously slower than querying local data. And it’s theoretically slower than querying an API directly from an application. But often the purpose is to populate a table with remote data. This table can be a regular InnoDB table, which will answer queries quickly. And the application can launch an ALTER TABLE ... ENGINE or INSERT ... SELECT query to move the data directly from the remote API to the local database. This will be showed later in this article.

There are also cases when the performance difference doesn’t matter. In the case of a data analyst, it should be ok to wait for more time to run a query, as long as running a query is all they need to do to read data from the API.

Querying the Hashicop Releases API in SQL

Let’s use progressively more complex examples so explore interesting CONNECT features.

The examples are based on the Hashicorp Release API, that can be used to find out whether HashiCorp software needs to be upgraded.

Exmple 1: Querying an array of strings

Let’s start by creating a table that contains very simple results from a REST API call. It returns an array of strings. In MariaDB, we’ll want to see it as a one-column table, where each string is a row.

CREATE OR REPLACE TABLE hashicorp_product (
    name VARCHAR(100) NOT NULL
)
    ENGINE = CONNECT,
    TABLE_TYPE = JSON,
    HTTP = 'https://api.releases.hashicorp.com/v1/products'
;

ENGINE: We’re specifying that the table must use the CONNECT engine, rather than the default engine (normally InnoDB).
TABLE_TYPE: CONNECT can handle a big range of data sources and formats, so we need to specify which one needs to be used in this case: JSON.
HTTP: For JSON tables, we can specify a URL. An HTTP call with the GET method, is expected to return the data we want to query. HTTPS is supported, as showed in the example below. Querystrings are not used in our examples, but they are supported, too.

Example 2: Array of objects, using table discovery

The most common case is REST APIs returning an array of objects. In MariaDB, we want to see every object as a row.

Objects are key/value data structures where each property is a column. Therefore CONNECT is able to create a table even if we don’t specify the columns we want! Each property name will be used as a column name, and it will infer the right types to use. This functionality is called table discovery, and there some other storage engines are able to use it, too.

CREATE OR REPLACE TABLE consul_release
    ENGINE = CONNECT,
    TABLE_TYPE = JSON,
    HTTP = 'https://api.releases.hashicorp.com/v1/releases/consul'
;

Now, let’s see the resulting table structure:

> SHOW CREATE TABLE consul_release G
*************************** 1. row ***************************
       Table: consul_release
Create Table: CREATE TABLE `consul_release` (
  `builds_arch` char(5) NOT NULL `JPATH`='$.builds[0].arch',
  `builds_os` char(6) NOT NULL `JPATH`='$.builds[0].os',
  `builds_url` char(100) NOT NULL `JPATH`='$.builds[0].url',
  `is_prerelease` tinyint(1) NOT NULL `JPATH`='$.is_prerelease',
  `license_class` char(10) NOT NULL `JPATH`='$.license_class',
  `name` char(6) NOT NULL `JPATH`='$.name',
  `status_state` char(9) NOT NULL `JPATH`='$.status.state',
  `status_timestamp_updated` char(24) NOT NULL `JPATH`='$.status.timestamp_updated',
  `timestamp_created` char(24) NOT NULL `JPATH`='$.timestamp_created',
  `timestamp_updated` char(24) NOT NULL `JPATH`='$.timestamp_updated',
  `url_changelog` char(79) NOT NULL `JPATH`='$.url_changelog',
  `url_docker_registry_dockerhub` char(52) NOT NULL `JPATH`='$.url_docker_registry_dockerhub',
  `url_docker_registry_ecr` char(51) NOT NULL `JPATH`='$.url_docker_registry_ecr',
  `url_license` char(53) NOT NULL `JPATH`='$.url_license',
  `url_project_website` char(37) NOT NULL `JPATH`='$.url_project_website',
  `url_release_notes` char(40) NOT NULL `JPATH`='$.url_release_notes',
  `url_shasums` char(95) NOT NULL `JPATH`='$.url_shasums',
  `url_shasums_signatures` char(99) NOT NULL `JPATH`='$.url_shasums_signatures[0]',
  `url_source_repository` char(35) DEFAULT NULL `JPATH`='$.url_source_repository',
  `version` char(19) NOT NULL `JPATH`='$.version'
) ENGINE=CONNECT DEFAULT CHARSET=utf8mb4 COLLATE=utf8mb4_uca1400_ai_ci `TABLE_TYPE`='JSON' `HTTP`='https://api.releases.hashicorp.com/v1/releases/consul'

Interesting, isn’t it? Let’s look at the first column, as an example:

`builds_arch` char(5) NOT NULL `JPATH`='$.builds[0].arch',

We can see that every row-object contains a nested array called builds that contains objects with these properties: arch, os, url. By default, the related columns are colled builds_arch, builds_os and builds_url, and they contain the values from the first element of the builds array.

CONNECT uses a purpose-built language called JPath to specify this. If you know JSONPath you won’t find JPATH hard to understand.

Example 3: Array of objects, specifying columns manually

Let’s use the above example again. But suppose that the table we want to build is different from the one created by default by MariaDB. We can run a statement like this:

CREATE OR REPLACE TABLE consul_release (
    version VARCHAR(100) NOT NULL,
    not_stable VARCHAR(100) NOT NULL
        FIELD_FORMAT = 'is_prerelease',
    state VARCHAR(100) NOT NULL
        FIELD_FORMAT = 'status.state',
    timestamp_created TIMESTAMP NOT NULL
)
    ENGINE = CONNECT,
    TABLE_TYPE = JSON,
    HTTP = 'https://api.releases.hashicorp.com/v1/releases/consul'
;

Now, here’s what we did:

We only specified a few columns. Some JSON properties will simply be ignored and won’t be part of the table.
The version and timestamp_created columns have the same name as the matching properties. So there is no need to specify where these values should come from.
The not_stable column has the same value as the is_prerelease property, that is not nested.
The state column has the same values as the state property, nested in an object property called status.

Let’s query the table:

> SELECT * FROM consul_release;
+---------------------+------------+-----------+---------------------+
| version             | not_stable | state     | timestamp_created   |
+---------------------+------------+-----------+---------------------+
| 1.16.5+ent.fips1402 | false      | supported | 2024-01-23 20:40:10 |
| 1.16.5+ent          | false      | supported | 2024-01-23 20:40:05 |
| 1.16.5              | false      | supported | 2024-01-23 20:31:17 |
| 1.15.9+ent          | false      | supported | 2024-01-23 20:28:28 |
| 1.17.2+ent.fips1402 | false      | supported | 2024-01-23 20:26:59 |
| 1.17.2+ent          | false      | supported | 2024-01-23 20:26:53 |
| 1.15.9              | false      | supported | 2024-01-23 20:23:38 |
| 1.17.2              | false      | supported | 2024-01-23 20:23:21 |
| 1.17.1+ent.fips1402 | false      | supported | 2023-12-14 23:16:04 |
| 1.17.1+ent          | false      | supported | 2023-12-14 23:15:57 |
+---------------------+------------+-----------+---------------------+

Note that only one status nested object exists (it’s not in an array).

Example 4: Expanding a nested array

Now let’s discuss a different case. As mentioned before, the builds property is an array. So, for every table row, an array exists with multiple values.

Suppose we want to expand it, and change the logic: we want to have a row for each builds element. Values that are outside that array will be repeated multiple times.

CREATE OR REPLACE TABLE consul_build (
    version VARCHAR(100) NOT NULL,
    arch VARCHAR(100) NOT NULL
        FIELD_FORMAT = 'builds.[*].arch',
    os VARCHAR(100) NOT NULL
        FIELD_FORMAT = 'builds[*].os',
    timestamp_created TIMESTAMP NOT NULL
)
    ENGINE = CONNECT,
    TABLE_TYPE = JSON,
    HTTP = 'https://api.releases.hashicorp.com/v1/releases/consul'
;

As you can see, we specified builds.[*].arch. This is a JPATH syntax which wouldn’t work in JSONPath. And it means that the builds array must be expanded. Here’s a query example:

> SELECT * FROM consul_build
  ORDER BY LEFT(version, 6) DESC
  FETCH FIRST 10 ROWS WITH TIES;
+---------------------+-------+---------+---------------------+
| version             | arch  | os      | timestamp_created   |
+---------------------+-------+---------+---------------------+
| 1.17.2+ent.fips1402 | amd64 | linux   | 2024-01-23 20:26:59 |
| 1.17.2+ent.fips1402 | arm64 | linux   | 2024-01-23 20:26:59 |
| 1.17.2+ent.fips1402 | amd64 | windows | 2024-01-23 20:26:59 |
| 1.17.2+ent          | amd64 | darwin  | 2024-01-23 20:26:53 |
| 1.17.2+ent          | arm64 | darwin  | 2024-01-23 20:26:53 |
| 1.17.2+ent          | 386   | freebsd | 2024-01-23 20:26:53 |
| 1.17.2+ent          | amd64 | freebsd | 2024-01-23 20:26:53 |
| 1.17.2+ent          | 386   | linux   | 2024-01-23 20:26:53 |
| 1.17.2+ent          | amd64 | linux   | 2024-01-23 20:26:53 |
| 1.17.2+ent          | arm   | linux   | 2024-01-23 20:26:53 |
| 1.17.2+ent          | arm64 | linux   | 2024-01-23 20:26:53 |
| 1.17.2+ent          | s390x | linux   | 2024-01-23 20:26:53 |
| 1.17.2+ent          | amd64 | solaris | 2024-01-23 20:26:53 |
| 1.17.2+ent          | 386   | windows | 2024-01-23 20:26:53 |
| 1.17.2+ent          | amd64 | windows | 2024-01-23 20:26:53 |
| 1.17.2              | amd64 | darwin  | 2024-01-23 20:23:21 |
| 1.17.2              | arm64 | darwin  | 2024-01-23 20:23:21 |
| 1.17.2              | 386   | freebsd | 2024-01-23 20:23:21 |
| 1.17.2              | amd64 | freebsd | 2024-01-23 20:23:21 |
| 1.17.2              | 386   | linux   | 2024-01-23 20:23:21 |
| 1.17.2              | amd64 | linux   | 2024-01-23 20:23:21 |
| 1.17.2              | arm   | linux   | 2024-01-23 20:23:21 |
| 1.17.2              | arm64 | linux   | 2024-01-23 20:23:21 |
| 1.17.2              | amd64 | solaris | 2024-01-23 20:23:21 |
| 1.17.2              | 386   | windows | 2024-01-23 20:23:21 |
| 1.17.2              | amd64 | windows | 2024-01-23 20:23:21 |
+---------------------+-------+---------+---------------------+

If you don’t understand the WITH TIES syntax, see MariaDB: WITH TIES syntax.

As you can see, a row was returned for every element of builds. Builds of the same release share the same values for version and timestamp_created.

Example 5: Transforming values

Sometimes the values returned by a remote API are not convenient for our applications. In this case, we may want to process and transform some values. This is something we can do with some MariaDB features that are not specific to CONNECT.

Let’s see an example first:

CREATE OR REPLACE TABLE consul_release (
    version VARCHAR(100) NOT NULL,
    state VARCHAR(100) NOT NULL
        FIELD_FORMAT = 'status.state',
    is_prerelease BOOL NOT NULL DEFAULT 0 INVISIBLE,
    is_stable BOOL GENERATED ALWAYS AS (NOT is_prerelease) VIRTUAL,
    timestamp_created TIMESTAMP NOT NULL
)
    ENGINE = CONNECT,
    TABLE_TYPE = JSON,
    HTTP = 'https://api.releases.hashicorp.com/v1/releases/consul'
;

is_prerelease is the column that contains the original value returned by the API. It is defined as an invisible column.
is_prerelase has a default value. This is required for invisible columns, but it has no effect for CONNECT JSON tables.
is_stable is a generated column, which contains the result off an expression (NOT is_prerelease).

SELECT * FROM consul_release will return the is_stable column, but not the is_prerelease column. It will still be possible to require this column explicitly by running:

SELECT *, is_prerelease FROM conssul_release;

Example 6: Querying a details API method with arguments

Many REST APIs have two types of GET methods:

To retrieve an array of objects (list);
To retrieve a single object (details).

Above we used two methods of the first type. Now we’re approaching a method of the second type.

And we have a problem: we have a single parametrised URL to call for any object we want to see. But for CONNECT there’s no such thing as parameters in URLs, so every API call should be handled by a different table. And we don’t want to have a CONNECT table for every release of every HashiCorp product.

What can we do about it? Well, at a low level we’ll actually have those tables. But we can hide the mechanism that creates and queries the table inside a stored procedure.

DELIMITER ||
CREATE OR REPLACE PROCEDURE select_consul_release(IN i_release VARCHAR(100))
    NOT DETERMINISTIC
    MODIFIES SQL DATA
BEGIN
    DECLARE v_sql TEXT;
    DECLARE v_table_name VARCHAR(64) DEFAULT CONCAT('consul_release_', CONNECTION_ID());
    SET v_sql := SFORMAT('
    CREATE OR REPLACE TABLE {} (
        version VARCHAR(100) NOT NULL,
        state VARCHAR(100) NOT NULL
            FIELD_FORMAT = ''status.state'',
        is_prerelease BOOL NOT NULL DEFAULT 0 INVISIBLE,
        is_stable BOOL GENERATED ALWAYS AS (NOT is_prerelease) VIRTUAL,
        timestamp_created TIMESTAMP NOT NULL
    )
        ENGINE = CONNECT,
        TABLE_TYPE = JSON,
        HTTP = ''https://api.releases.hashicorp.com/v1/releases/consul/{}''
    ;
    ', v_table_name, i_release);
    EXECUTE IMMEDIATE v_sql;

    EXECUTE IMMEDIATE CONCAT('SELECT * FROM ', v_table_name, ';');
    EXECUTE IMMEDIATE CONCAT('DROP TABLE ', v_table_name, ';');
END;
||
DELIMITER ;

This procedure creates a CONNECT table from a REST URL that contains the desired argument (the release name). Then it runs a SELECT to return its contents, and destroys the table. If multiple connections are likely to use this procedure concurrently, they won’t interfere with each other because the table name contains the current connection id.

Alternatively, you might want multiple connections to share existing CONNECT tables. To do so, use CREATE TABLE IF NOT EXISTS (rather than OR REPLACE) and use the release name as part of the table name (rather than the connection id).

Proof that it works:

> CALL select_consul_release('1.17.2');
+---------+-----------+-----------+---------------------+
| version | state     | is_stable | timestamp_created   |
+---------+-----------+-----------+---------------------+
| 1.17.2  | supported |         1 | 2024-01-23 20:23:21 |
+---------+-----------+-----------+---------------------+

These procedures won’t work in MySQL because they use the unique MariaDB features:

The CONNECT engine, obviously;
CREATE OR REPLACE TABLE, which is similar to DROP TABLE + CREATE TABLE, but it’s atomic and less verbose;
SFORMAT(), to compose strings by interpolation (see How to compose strings in MariaDB);
EXECUTE IMMEDIATE, which has some limitations but can normally be used as a shortcut for PREPARE + EXECUTE + DEALLOCATE PREPARE.

Materialising a CONNECT table

We mentioned before that a CONNECT table can be materialised. This means that it becomes a regular table, which normally uses the InnoDB storage engine, and the remote data is written to the local database.

One off materialisation

The simplest and fastest way to materialise a CONNECT table is the following:

ALTER TABLE table_name ENGINE = InnoDB;
RENAME TABLE table_name TO new_name;

I don’t like to give numbers that might depend on a lot of factors, but in most cases it will take less than 5 seconds even for tables that contain a few millions of rows.

Incremental materialisation

Your logic might be more complex than this. Maybe you want to incrementally import data from a remote API every night. This means that you can’t create a new table every time.

The first time you import data you will need to create the table with the current data. You can use the method I have shown above, for a one off materialisation.

Incremental imports can be done in this way:

INSERT INTO materialised_table
    (column1, column2, column3)
    SELECT column1, column2, column3 FROM connect_table
;

JSON file

I left this method as the last one because it’s rarely useful. But you might want to know about it if you need a JSON file, and anyway it tells us something about how CONNECT works internally.

You might have noticed that all the above CONNECT tables use the JSON TABLE_TYPE. This table type was originally designed to use JSON files as relational tables. To do this, one has to specify a file name:

CREATE TABLE table_name
...
ENGINE = CONNECT,
TABLE_TYPE = JSON,
FILE_NAME = 'filename.json';

If the file exists in the data directory, it will be used. If not, it will be created.

But even if we didn’t specify a file name, one was created anyway. If you ran the CREATE TABLEs above, you probably noticed warnings like this:

*************************** 1. row ***************************
  Level: Warning
   Code: 1105
Message: No file name. Table will use hashicorp_product.json

And if you look in the data directory, in the database subdirectory, we’ll see this:

# ls -l /var/lib/mysql/test/hashicorp_product.*
-rw-rw---- 1 mysql mysql 907 Jan 27 14:23 /var/lib/mysql/test/hashicorp_product.frm
-rw-rw---- 1 mysql mysql   0 Jan 27 14:23 /var/lib/mysql/test/hashicorp_product.json

Initially the file is empty. After we run a query, it fill be filled with the output received from the REST API, regardless any WHERE in our query:

# cat hashicorp_product.json 
["athena-cli","atlas-upload-cli",...]

So when we query a CONNECT JSON table, CONNECT will check the HTTP table option. If it was specified, CONNECT will call that URL and populate (or overwrite) the JSON file. The rest of the operations are performed as if it was a regular file-based CONNECT table.

Limitations

CONNECT JSON tables have some important limitations:

Only the GET method is supported. This is theoretically correct because only GET should be used to read data, but in practice some APIs might require a different method.
Some APIs require an additional call for authorisation. CONNECT has no way to make an additional call.
No way to specify an HTTP(S) header to add to the requests. This is often required for authorisation (you call an authorisation method, you receive back a token valid for a period of time, and you add that token to the next requests, in the Autorization HTTP header). But these features are independent: tokens are not always passed in an HTTP header, and HTTP headers have other uses too.
There is no way to see the response HTTP code, nor any other response headers (useful for redirects and errors).
If a token or an API key is written in the URL, it will be readable with the SHOW CREATE TABLE command or by querying the information_schema.TABLES table.
Every URL is unique, even if it contains parameters. We can’t use the WHERE clause to modify the actual URL that is called.
Some APIs have a usage limit. For example they might accept no more than 10 calls per hour, or 100 per day. While the contents of an API can be materialised as discussed above, it would be very handy if CONNECT could cache results of an API call for a period of time.

Some of these limitations can be removed or lifted by implementing the necessary features as stored procedures. I will demonstrate this in a future article.

Conclusions

The CONNECT storage engine allows easy data integration between MariaDB and other technologies. In this article we saw how to use CONNECT to query a REST API. We discussed several examples, progressively more complex, which covers the majority of the problems that you might have when working with CONNECT and a REST API.

Should you have questions, or problems with CONNECT, don’t hesitate to contact us for a consultation.

Federico Razzoli

The post How to query a REST API with MariaDB CONNECT engine appeared first on MariaDB.org.

Partially Rolling Back a Transaction in MySQL or PostgreSQL

2024-04-12T13:36:18+00:00

This short write-up focuses on a different transaction control behavior of databases. Though this is not unusual, I decided to write an article on rolling back transactions to a particular point. I selected this topic because I found many people are not aware of this feature in databases.DescriptionEvery ACID-compliant RDBMS follows the “All or None” […]

The post Partially Rolling Back a Transaction in MySQL or PostgreSQL appeared first on MariaDB.org.

Sysbench on a (less) small server: MariaDB and MySQL

2024-04-12T00:40:00+00:00

This has results from the sysbench benchmark for MariaDB and MySQL on a (less) small server with a cached and low-concurrency workload. For MariaDB I tested LTS releases from 10.2 through 11.4. For MySQL I tested 5.6, 5.7 and 8.0. The results from MySQL here are a good reason to use changepoint detection to spot regressions early, like that provided by Nyrkiö.

This work was done by Small Datum LLC and sponsored by the MariaDB Foundation. A previous post shared results from my smallest and oldest server. This has results from two newer and faster, but still small, servers. Regardless, the results don’t change.

My standard disclaimer is that sysbench with low-concurrency is great for spotting CPU regressions. However, a result with higher concurrency from a larger server is also needed to understand things. Results from IO-bound workloads and less synthetic workloads are also needed. But low-concurrency, cached sysbench is a great place to start.

tl;dr

MariaDB is great at avoiding CPU regressions over time
MySQL is less than great at avoiding CPU regressions over time
Modern MariaDB is 13% to 36% faster than modern MySQL (on this setup)
Enabling the InnoDB change buffer does not improve results here. It might help for IO-bound sysbench.
Modern MariaDB does reads from storage for the redo log (read-modify-write) while MySQL does not. I know the MariaDB redo log architecture has changed from MySQL — MariaDB uses just one large redo log file. But I can’t fully explain this.

Builds and configuration

For MariaDB I used the latest point releases from LTS versions: 10.2.44, 10.3.39, 10.4.33, 10.5.24, 10.6.17, 10.11.7 and the upcoming LTS 11.4.1.

For MySQL I used 5.6.51, 5.7.44 and 8.0.36.

The first GA release for MariaDB 10.2 was in 2017. The first GA release for MySQL 5.6 was in 2013. So while I cannot claim that my testing covers MySQL and MariaDB from the same time period, I can claim that I am testing old versions of both.

Everything was compiled from source with similar CMake command lines and CMAKE_BUILD_TYPE set to Release. It is much easier to compile older MariaDB releases than older MySQL releases. For MariaDB I did not have to edit any source files. Notes on compiling MySQL are here for 5.6, for 5.6 and 5.7, for 5.7 and for 8.0. A note on using cmake is here.

The my.cnf files are here for the ser4 and the ser7 servers. Note that the ser7 server has a faster CPU and twice as much RAM.

Benchmarks

I used sysbench and my usage is explained here. There are 42 microbenchmarks and each tests ~1 type of SQL statement.

Tests were run on two variants of the small servers I have at home. Each has 8 cores and NVMe SSD with XFS and Ubuntu 22.04. The servers are described here.

ser4 – this is the v4 server here with 16G of RAM and has a Ryzen 7 4700U CPU. It is a Beelink SER 4700U so I call it the ser4 (or SER4).
ser7 – this is the v5 server here with 32G of RAM and has a Ryzen 7 7840HS CPU. It is a Beelink SER7 so I call it the ser7 (or SER7).

The benchmark is run with:

one connection
30M rows and a database cached by InnoDB
each microbenchmark runs for 300 seconds if read-only and 600 seconds otherwise
prepared statements were enabled

The command line was: bash r.sh 1 30000000 300 600 nvme0n1 1 1 1

Results

For the results below I split the 42 microbenchmarks into 5 groups — 2 for point queries, 2 for range queries, 1 for writes. For the range query microbenchmarks, part 1 has queries that don’t do aggregation while part 2 has queries that do aggregation. The spreadsheet with all data and charts are here for ser4 and for ser7.

All of the charts have relative throughput on the y-axis where that is (QPS for $me) / (QPS for $base), $me is some DBMS version (for example MariaDB 11.4.1) and $base is the DBMS version for the base case. The base version is specified below depending on what I am comparing. The y-axis doesn’t start at 0 to improve readability. When the relative throughput is > 1 then the throughput on some DBMS version is greater than the throughput for the base case.

The legend under the x-axis truncates the names I use for the microbenchmarks and I don’t know how to fix that other than sharing links (see above) to the Google Sheets I used.

Results: MariaDB from old to new

This section uses MariaDB 10.2.44 as the base version and then compares that with MariaDB versions 10.3.39, 10.4.33, 10.5.24, 10.6.17, 10.11.7 and 11.4.1. The goal is to determine how throughput (QPS) changes from older releases like 10.2 to the latest release (11.4).

These tables have summary statistics from ser4 and ser7 of the relative throughput for MariaDB 11.4.1 vs 10.2.44 for each of the microbenchmark groups. A value greater than one means the throughput for MariaDB 11.4.1 is better than for 10.2.44. From the results here, new MariaDB (11.4.1) gets at least 91% with ser4 and 93% with ser7 of the throughput relative to old MariaDB (10.2.44) using the median relative throughput per microbenchmark group. New features added to MariaDB don’t get in the way of performance because there aren’t significant regressions over time.

from ser4	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	0.96	0.95	0.99	0.96	0.91
median	0.96	0.97	0.99	0.94	0.91
min	0.90	0.91	0.93	0.88	0.77
max	0.98	0.98	1.11	1.07	1.03
stddev	0.02	0.03	0.05	0.09	0.07

from ser7	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	0.94	0.94	0.95	0.98	0.92
median	0.96	0.96	0.93	0.96	0.95
min	0.87	0.88	0.90	0.93	0.76
max	0.97	0.99	1.10	1.04	1.02
stddev	0.03	0.05	0.06	0.04	0.08

There are two graphs per microbenchmark group – first for ser4 and second for ser7. The y-axis doesn’t begin at zero to improve readability. The y-axis starts at 0.75 for ser4 and 0.60 for ser7.

Graphs for point queries, part 2.

Graphs for range queries, part 1.

Graphs for range queries, part 2.

Graphs for writes.

Results: MySQL from old to new

This section uses MySQL 5.6.51 as the base version and then compares that with MySQL versions 5.7.44 and 8.0.36. The goal is to determine how throughput (QPS) changes from older to newer releases.

These tables have summary statistics from ser4 and ser7 of the relative throughput for MySQL 8.0.36 vs 5.6.51 for each of the microbenchmark groups. A value greater than one means the throughput for MySQL 8.0.36 is better than for 5.6.51. From the results here, new MySQL (8.0.36) gets between 56% and 89% of the throughput relative to old MySQL (5.6.51) using the median relative throughput per microbenchmark group. New features in modern MySQL come at the cost of much CPU overhead.

from ser4	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	0.72	0.71	0.66	0.81	0.61
median	0.70	0.70	0.65	0.76	0.56
min	0.67	0.61	0.63	0.66	0.44
max	0.78	0.82	0.72	1.00	0.99
stddev	0.04	0.09	0.03	0.15	0.16

from ser7	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	0.77	0.77	0.72	0.92	0.71
median	0.80	0.75	0.73	0.89	0.67
min	0.65	0.69	0.62	0.80	0.51
max	0.84	0.86	0.77	1.07	1.10
stddev	0.05	0.07	0.05	0.12	0.18

There are two graphs per microbenchmark group – first for ser4 and second for ser7. The y-axis doesn’t begin at zero to improve readability. The y-axis starts at 0.40 for ser4 and 0.50 for ser7.

Graphs for point queries, part 2.

Graphs for range queries, part 1.

Graphs for range queries, part 2.

Graphs for writes.

Results: MariaDB vs MySQL

This section uses MySQL 8.0.36 as the base version and then compares that with MariaDB 11.4.1. The goal is to determine which DBMS gets more throughput (or uses less CPU/query).

These tables have summary statistics from ser4 and ser7 of the relative throughput for MariaDB 11.4.1 vs MySQL 8.0.36 for each of the microbenchmark groups. A value greater than one means the throughput for MariaDB 11.4.1 is better than for MySQL 8.0.36. From the results here, modern MariaDB (11.4.1) gets between 113% and 136% of the throughput relative to modern MySQL (8.0.36) using the median relative throughput per microbenchmark group. Modern MariaDB is faster than modern MySQL (on this setup) because MySQL has more performance regressions over time.

from ser4	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	1.17	1.21	1.21	1.19	1.31
median	1.15	1.22	1.21	1.17	1.36
min	1.12	1.19	1.15	1.16	1.11
max	1.30	1.24	1.37	1.25	1.48
stddev	0.05	0.02	0.07	0.04	0.12

from ser7	point, part 1	point, part 2	range, part 1	range, part 2	writes
average	1.16	1.20	1.18	1.17	1.23
median	1.13	1.20	1.15	1.15	1.28
min	1.10	1.16	1.07	1.10	0.96
max	1.32	1.24	1.40	1.26	1.41
stddev	0.06	0.03	0.10	0.07	0.14

There are two graphs per microbenchmark group – first for ser4 and second for ser7. The y-axis doesn’t begin at zero to improve readability. The y-axis starts at 1.00 for both ser4 and ser7.

Graphs for point queries, part 2.

Graphs for range queries, part 1.

Graphs for range queries, part 2.

Graphs for writes. The bar for update-index_range=1000 isn’t visible on the second graph because the relative throughput is 0.96 but the y-axis starts at 1.00.

Results: change buffer

The InnoDB change buffer was removed from MariaDB 11.4. That feature has been good to me during the Insert Benchmark so I repeated tests in MariaDB 10.11.7 with it enabled and disabled. For the sysbench microbenchmarks and this workload (low concurrency, cached database) there is no benefit from it. It might have a benefit for some sysbench microbenchmarks when the database isn’t cached.

This post is already too long so I won’t provide charts but the raw data is here for ser4 and for ser7.

Results: disk read mystery

The database is cached by InnoDB for these tests but the sum of the sizes of the InnoDB buffer pool and InnoDB redo logs are larger than memory. And that is a normal (best practice) configuration. Unfortunately, starting in MariaDB 10.11 there is read IO during some of the microbenchmark steps. All of the configs use either O_DIRECT or O_DIRECT_NO_FSYNC for InnoDB flush method and I confirmed that database file reads aren’t being done based on the output of SHOW ENGINE INNODB STATUS.

What might be the source of the reads? My first guess is read-modify-write (RMW) from the InnoDB redo log. The problem is that the redo log uses buffered IO (not O_DIRECT), redo log writes are done as a multiple of 512 bytes and writing to the first 512 bytes of a 4kb filesystem page will do a storage read if that 4kb page isn’t in the OS page cache. Note that innodb_log_write_ahead_size in MySQL 8 reduces the chance of this happening. It was my first guess because we suffered from it long ago with web-scale MySQL.

This happens on the ser4 server but not the ser7 server. The ser7 server has twice as much RAM but uses the same size tables, so there is more spare RAM and perhaps that makes it easier for Linux to cache the InnoDB redo log.

I ran additional tests where the InnoDB redo log and InnoDB data files use different storage devices I see that all of the read IO occurs on the storage device for the redo log so my guess was correct but why this occurs for modern MariaDB but not for MySQL remains a mystery.

I also looked at strace output from MariaDB and MySQL. I didn’t see anything different. Although the redo log architecture has changed in MariaDB. It now uses just one redo log file while MySQL uses as many as you configure and the my.cnf files I used are here for the ser4 and the ser7 servers.

The unexpected storage reads might not hurt performance here because I use innodb_flush_log_at_trx_commit =2 (no sync on commit) and user transactions might not have to wait for that read on commit.

I have a script that provides iostat and vmstat metrics normalized by throughput (operations/s) to show how much hardware is used per request. And the problem is visible here for several microbenchmark steps including update-inlist.range100.pk1 and update-index.range100,pk1. The columns in the output are:

cpu/o – CPU per operation (from vmstat us + sy)
cs/o – context switches per operation (from vmstat cs)
r/o – storage reads per operation (from iostat r/s)
rKB/o – storage KB read per operation (from iostat rKB/s)
wKB/o – storage KB written per operation (from iostat wKB/s)
o/s – throughput, QPS, operations/s

In the tables below the value for r/o is 0 prior to MariaDB 10.11.7 (ma101107_rel) and then is 0.077 for MariaDB 10.11.7 and 0.034 for MariaDB 11.4.1. While I don’t show it here, this problem also occurs for MySQL 5.6.51 but not for MySQL 5.7.44 or 8.0.36.

The data from the tables below is here for ser4 and for ser7.

sb.met.update-inlist.range100.pk1.dop1

— absolute

cpu/o cs/o r/o rKB/o wKB/o o/s dbms

0.006937 36.076 0 0 90.665 3966 x.ma100244_rel.z11a_bee.pk1

0.007467 35.345 0 0 89.534 3845 x.ma100339_rel.z11a_bee.pk1

0.007336 35.395 0 0 91.588 3722 x.ma100433_rel.z11a_bee.pk1

0.005755 37.579 0 0 7.408 4126 x.ma100524_rel.z11a_bee.pk1

0.004227 8.442 0 0 33.077 3890 x.ma100617_rel.z11a_bee.pk1

0.004253 8.595 0.077 0.306 33.099 3856 x.ma101107_rel.z11a_bee.pk1

0.004430 8.462 0.034 0.136 32.22 3650 x.ma110401_rel.z11b_bee.pk1

— relative to first result

1.08 0.98 1 1 0.99 0.97 x.ma100339_rel.z11a_bee.pk1

1.06 0.98 1 1 1.01 0.94 x.ma100433_rel.z11a_bee.pk1

0.83 1.04 1 1 0.08 1.04 x.ma100524_rel.z11a_bee.pk1

0.61 0.23 1 1 0.36 0.98 x.ma100617_rel.z11a_bee.pk1

0.61 0.24 inf inf 0.37 0.97 x.ma101107_rel.z11a_bee.pk1

0.64 0.23 inf inf 0.36 0.92 x.ma110401_rel.z11b_bee.pk1

sb.met.update-index.range100.pk1.dop1

— absolute

cpu/o cs/o r/o rKB/o wKB/o o/s dbms

0.015003 34.740 0 0 110.865 2268 x.ma100244_rel.z11a_bee.pk1

0.016460 39.791 0 0 123.805 1715 x.ma100339_rel.z11a_bee.pk1

0.016282 39.626 0 0 124.773 1734 x.ma100433_rel.z11a_bee.pk1

0.010618 55.358 0 0 111.238 2277 x.ma100524_rel.z11a_bee.pk1

0.007834 12.873 0 0 53.445 1997 x.ma100617_rel.z11a_bee.pk1

0.006468 14.170 0.857 3.426 50.978 2453 x.ma101107_rel.z11a_bee.pk1

0.006733 14.250 0.859 3.434 51.687 2337 x.ma110401_rel.z11b_bee.pk1

— relative to first result

1.10 1.15 1 1 1.12 0.76 x.ma100339_rel.z11a_bee.pk1

1.09 1.14 1 1 1.13 0.76 x.ma100433_rel.z11a_bee.pk1

0.71 1.59 1 1 1.00 1.00 x.ma100524_rel.z11a_bee.pk1

0.52 0.37 1 1 0.48 0.88 x.ma100617_rel.z11a_bee.pk1

0.43 0.41 inf inf 0.46 1.08 x.ma101107_rel.z11a_bee.pk1

0.45 0.41 inf inf 0.47 1.03 x.ma110401_rel.z11b_bee.pk1

The post Sysbench on a (less) small server: MariaDB and MySQL appeared first on MariaDB.org.

Impact of Fragmented PostgreSQL Infrastructure on Performance, Scalability, and Security

2024-04-11T19:19:24+00:00

A fragmented PostgreSQL infrastructure can significantly impact several critical aspects of database management, including performance, scalability, high availability, reliability, and data security. Fragmentation in this context can refer to both data fragmentation (data spread across a database inefficiently) and infrastructure fragmentation (inconsistent configuration or deployment of database components). Understanding the repercussions can help in structuring more cohesive and robust database systems.

1. Performance

Impact: Fragmented data can lead to inefficient use of storage and slow query performance. When data is not contiguous, more disk I/O is required to retrieve the same amount of data, which slows down read and write operations. On an infrastructure level, inconsistent configurations across database nodes (in a cluster environment) can lead to uneven load distribution and inefficient resource utilization.

Mitigation:

Regular maintenance routines like VACUUM and REINDEX can help in managing data fragmentation.
Ensuring consistent configuration across servers using configuration management tools or templates.

2. Scalability

Impact: Fragmented infrastructure can hinder scalability due to the complexities of adding new nodes or resources that need to integrate with an inconsistent environment. Scaling out (adding more nodes) or scaling up (adding resources to existing nodes) can be problematic if each part of the infrastructure does not adhere to a common standard or practice.

Mitigation:

Implement standardized deployment processes and use automation tools to ensure that new nodes or resources are added seamlessly.
Design data partitioning strategies to manage large datasets effectively and reduce bottlenecks.

3. High Availability

Impact: A fragmented approach to high availability, where different nodes or clusters have varied failover mechanisms or replication strategies, can lead to increased downtime and data loss during failures. Discrepancies in replication setups or failover protocols can cause delays in recovery or failovers that do not operate as expected.

Mitigation:

Use a consistent, well-documented approach to replication and failover across all database nodes.
Regularly test failover procedures to ensure that they work correctly under various failure scenarios.

4. Reliability

Impact: Inconsistent configurations and patch levels across database components can lead to unpredictable behavior and system crashes, reducing the overall reliability of the system. Fragmented maintenance and backup strategies can also lead to data inconsistencies and restoration issues.

Mitigation:

Standardize on software versions and patch processes across all components.
Implement a unified backup strategy that ensures all parts of the database are backed up in sync.

5. Data Security

Impact: A fragmented security setup, where different parts of the database follow different security protocols, can create vulnerabilities. Inconsistent application of security updates, configurations, and access controls can lead to breaches and data leaks.

Mitigation:

Enforce uniform security policies across all database servers and components.
Regular audits and updates of security configurations and practices to ensure compliance and protection.

Conclusion

Fragmentation in PostgreSQL infrastructure can lead to serious challenges across several critical aspects of database management. Addressing these issues requires a strategic approach focusing on standardization, automation, and regular maintenance. By mitigating fragmentation, organizations can enhance their database systems’ efficiency, reliability, and security, ensuring that they are robust and scalable enough to meet current and future demands.

How to Troubleshoot Index Fragmentation in PostgreSQL?

Database SRE

Implementing High Availability in PostgreSQL: A Step-By-Step Guide to Setting Up Streaming Replication

The post Impact of Fragmented PostgreSQL Infrastructure on Performance, Scalability, and Security appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post Impact of Fragmented PostgreSQL Infrastructure on Performance, Scalability, and Security appeared first on MariaDB.org.

Evaluating the Impact of Missing Statistics on PostgreSQL Query Performance

2024-04-11T14:21:16+00:00

Comparing PostgreSQL performance with and without up-to-date statistics can significantly illustrate the importance of statistics for database performance, particularly in query planning and execution efficiency. PostgreSQL uses statistics gathered by the ANALYZE command to choose the most efficient query execution plan. Missing or outdated statistics can lead to poor decisions, such as choosing a sequential scan over an index scan or misestimating the number of rows a query operation might return.

Impact of Up-to-Date Statistics on Performance

Query Optimization: PostgreSQL’s query planner uses statistics to estimate the cost of different query plans and choose the most efficient one. Accurate statistics lead to better cost estimates and, consequently, more efficient query execution plans.
Index Utilization: With accurate statistics, the planner can more effectively determine when to use indexes. This can drastically reduce the time required to execute queries by avoiding full table scans.
Join Orders: For queries involving joins, up-to-date statistics help the planner determine the most efficient order to join tables. Incorrect join orders can significantly increase query execution time.

Performance with Missing or Outdated Statistics

Inefficient Query Plans: Without accurate statistics, the planner may make suboptimal choices, such as unnecessarily using full table scans over index scans, leading to slower query responses.
Resource Misallocation: The planner might allocate more resources than necessary for a query operation, affecting the overall system performance and leading to resource contention.
Increased Planning Time: In some cases, the lack of statistics may cause the planner to spend more time evaluating different execution plans, which can slightly increase the planning time for queries.

Testing the Impact

To empirically compare PostgreSQL performance with and without up-to-date statistics, you can conduct a controlled test:

Setup a Test Environment: Ensure you have a representative dataset and query workload.
Baseline Performance Measurement: Execute your typical query workload and measure performance metrics such as execution time, CPU utilization, and I/O operations with up-to-date statistics.
- Run ANALYZE on your database to ensure statistics are current.
Invalidate Statistics:
- Temporarily disable the autovacuum daemon to prevent it from updating statistics.
- Use ALTER TABLE table_name ALTER COLUMN column_name SET STATISTICS -1; on several tables and columns to invalidate statistics. Be cautious with this step, as it affects query planning.
Performance Measurement with Missing Statistics: Execute the same query workload and measure the performance metrics without up-to-date statistics.
Analyze Results: Compare the performance metrics from both tests to identify differences. Look for increased execution times, higher resource consumption, and any changes in query plans.
Restore Environment: Re-enable autovacuum and run ANALYZE on the entire database or affected tables to restore statistics.

Conclusion

While conducting such a test, you’ll likely observe a degradation in query performance when PostgreSQL operates with missing or outdated statistics. This experiment reinforces the importance of maintaining accurate statistics for optimal database performance. Regularly monitoring statistics health and configuring autovacuum and analyze settings appropriately are key practices for sustaining high performance in PostgreSQL databases.

Preventing Broken Foreign Keys in PostgreSQL: Causes and Solutions

How to define and capture Baselines in PostgreSQL Performance Troubleshooting?

Optimizing Query Performance in PostgreSQL 16 with the Advanced auto_explain Extension

The post Evaluating the Impact of Missing Statistics on PostgreSQL Query Performance appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post Evaluating the Impact of Missing Statistics on PostgreSQL Query Performance appeared first on MariaDB.org.

Optimizing Query Performance: Troubleshooting and Resolving Outdated Statistics in PostgreSQL

2024-04-11T13:52:02+00:00

Outdated statistics in PostgreSQL can lead to suboptimal query plans, affecting the performance of your database queries. PostgreSQL uses statistics gathered by the ANALYZE command (either manually or automatically by the autovacuum daemon) to make informed decisions about query plans. Here’s how to troubleshoot and resolve issues related to outdated statistics:

Identifying Outdated Statistics

Query Planning Inefficiency: Noticeable slowdowns in query performance, especially for queries that used to run efficiently, might indicate outdated statistics.
Check Last Analyze Time: Use the pg_stat_all_tables view to check the last time tables were analyzed. If the last_analyze timestamp is significantly old, the statistics may be outdated.
```
SELECT relname, last_analyze
FROM pg_stat_all_tables
WHERE schemaname = 'public';
```

Troubleshooting and Resolving Issues

1. Manually Run ANALYZE

If you identify tables with outdated statistics, manually run the ANALYZE command to update the statistics:

ANALYZE verbose tablename;

Use ANALYZE verbose to analyze all tables and to receive detailed output. Omit tablename to analyze the entire database.

2. Configure Autovacuum and Autoanalyze

Ensure that the autovacuum daemon is properly configured to automatically analyze tables:

Check Autovacuum Settings: PostgreSQL’s autovacuum daemon should be enabled and properly configured to automatically analyze tables. Check postgresql.conf for settings like autovacuum, autovacuum_analyze_threshold, and autovacuum_analyze_scale_factor.
```
autovacuum = on
autovacuum_analyze_threshold = 50
autovacuum_analyze_scale_factor = 0.1
```
Adjust Autovacuum Parameters: If autovacuum doesn’t run as expected, consider adjusting its parameters. For example, reducing autovacuum_analyze_scale_factor can make autovacuum run analyze more frequently.

3. Monitor Autovacuum Logs

Enable Logging: Ensure that logging for autovacuum activities is enabled to monitor its operations. You can adjust the log_autovacuum_min_duration setting in postgresql.conf to log autovacuum and autoanalyze operations taking longer than a specified duration.
```
log_autovacuum_min_duration = '0s'  -- Logs all autovacuum activities
```
Review the Logs: Regularly check the PostgreSQL logs to ensure autovacuum and analyze processes are completing successfully.

4. Use pg_stat_statements to Identify Problematic Queries

Enable pg_stat_statements: This extension provides insights into execution statistics of all SQL statements executed by the server, helping identify queries that may suffer from outdated statistics.
```
CREATE EXTENSION IF NOT EXISTS pg_stat_statements;
```
Analyze Output: Review the output of pg_stat_statements to identify queries with poor performance, which might benefit from updated statistics.

Best Practices for Maintaining Up-to-date Statistics

Regular Maintenance: Schedule regular maintenance windows to manually analyze tables if autovacuum settings do not meet your specific workload requirements.
Tune Autovacuum Parameters: Adjust autovacuum settings based on your database size, workload, and performance observations. This may involve tweaking thresholds and scale factors or increasing worker processes.
Monitor Database Logs: Keep an eye on database logs for autovacuum activity and potential warnings related to statistics.

Maintaining up-to-date statistics is crucial for optimal query planning and overall database performance. Regular monitoring and appropriate configuration of PostgreSQL’s analyze functions can help ensure that your database statistics remain current, preventing performance degradation due to outdated statistics.

Optimizing PostgreSQL: A Guide to Troubleshooting Long-Running Queries and Wait Events

Implementing the Materialized Path Model in PostgreSQL: A Step-by-Step Guide

Mastering Row Locks in PostgreSQL: Ensuring Data Integrity and Performance

The post Optimizing Query Performance: Troubleshooting and Resolving Outdated Statistics in PostgreSQL appeared first on The WebScale Database Infrastructure Operations Experts in PostgreSQL, MySQL, MariaDB and ClickHouse.

The post Optimizing Query Performance: Troubleshooting and Resolving Outdated Statistics in PostgreSQL appeared first on MariaDB.org.

How to Replicate and Rename a Database in MariaDB

2024-04-11T13:14:12+00:00

MySQL/MariaDB replication filter is an important feature when we need to replicate only certain databases or tables. Having this configuration option change dynamically is really convenient, but in this article, we’ll note that some replication filters are not dynamic, and you should be aware of that.The use case here is to replicate one database from […]

The post How to Replicate and Rename a Database in MariaDB appeared first on MariaDB.org.

17 Years of Insecure MySQL Client !

2024-04-10T14:42:00+00:00

Yes, this is a catchy title, but it is true, and it got you reading this post :-). Another title could have been “Please load this mysql-dump: what could go wrong ?”. As you guessed, loading a dump is not a risk-free operation. In this post, I explain how the insecure MySQL client makes this operation risky and how to protect against it.

And if you think this post is not

The post 17 Years of Insecure MySQL Client ! appeared first on MariaDB.org.

Ask Me Anything About MySQL 5.7 to 8.0 Post EOL

2024-04-09T16:02:36+00:00

We met with Vinicius Grippa, a Senior Support Engineer at Percona. He is also active in the open source community and was recognized as a MySQL Rock Star in 2023.In the previous interview with Vinicius, we discussed the upcoming End of Life (EOL) for MySQL 5.7. Now that MySQL 5.7 has reached EOL, MySQL 8 […]

The post Ask Me Anything About MySQL 5.7 to 8.0 Post EOL appeared first on MariaDB.org.

Why Top Spanish Business School Esade Migrated from MySQL to MariaDB Enterprise Server

2024-04-09T16:00:27+00:00

Esade Ramon Llull University is a top business school located in Barcelona, Spain. They are committed to nurturing the next generation of leaders worldwide and welcoming people from various business realities, geographies, and cultures. In 2021, their student community included over 125 nationalities. The Esade faculty aims to give their students the best possible opportunities by adapting to…

Source

The post Why Top Spanish Business School Esade Migrated from MySQL to MariaDB Enterprise Server appeared first on MariaDB.org.

MariaDB Joins Forces with Google Cloud to Enhance Support Operations on Google Distributed Cloud

2024-04-09T12:00:30+00:00

In an era where the cloud is omnipresent, yet the needs for data sovereignty, security, and localized performance are paramount, Google Distributed Cloud (GDC) stands out as a beacon of innovation. Tailored for applications deployed at the edge or on-premises, Google Distributed Cloud addresses the specialized demands of today’s diverse technology landscapes. At the heart of this evolution lies a…

Source

The post MariaDB Joins Forces with Google Cloud to Enhance Support Operations on Google Distributed Cloud appeared first on MariaDB.org.

MariaDB's parallel replication to catch up

2024-04-09T08:53:32+00:00

Due to an application error, our replication stopped for 5 days (over Easter). After the problem was solved, the replication was supposed to catch up, which turned out to be very slow. All the usual tricks (innodb_flush_log_at_trx_commit, sync_binlog, etc.) had already been exhausted. So we tried our hand at parallel replication of the MariaDB server.

Parallel replication is deactivated by default:

Parallel replication is activated by setting the server variables slave_parallel_threads:

SQL > SET GLOBAL slave_parallel_threads = 8;
ERROR 1198 (HY000): This operation cannot be performed as you have a running slave \'\'; run STOP SLAVE \'\' first

However, this must be done when replication is stopped:

SQL > STOP SLAVE;
SQL > SET GLOBAL slave_parallel_threads = 8;
SQL > START SLAVE;

Replication then caught up a little faster. However, as we were impatient, we tried to make it even faster. With the command:

SQL > SHOW SLAVE STATUSG
...
Slave_SQL_Running_State: Waiting for room in worker thread event queue
...

we found the following message. You would also see it using the SHOW PROCESSLIST command:

SQL > SHOW PROCESSLIST;
+--------+-------------+- ... -+-----------+------+-----------------------------------------------+- ... -+
| Id | User | ... | Command | Time | State | ... |
+--------+-------------+- ... -+-----------+------+-----------------------------------------------+- ... -+
... ... ...
| 212496 | system user | ... | Slave_SQL | 16 | Waiting for room in worker thread event queue | ... |
+--------+-------------+- ... -+-----------+------+-----------------------------------------------+- ... -+

According to the documentation, it can help in this case to increase the size of the slave_parallel_max_queued variable slightly (attention: Oom!).

We have played around with the values slave_parallel_threads in the range from 4 to 32 (with 8 vCores) and with slave_parallel_max_queued in the range from 128 kbyte to 32 Mbyte.
Caution: Do not exaggerate: 32 threads x 32 Mbyte = 1 Gbyte RAM (Oom)!

To find out which values are the optimum, you would have to test and measure more extensively. In any case, the replication made up the 5-day backlog after about an hour, towards the end a little more than at the beginning, which was hopefully caused by our configuration adjustments.

Depending on what DML statements are currently running, you can see that all threads can be used or that some threads have to wait for other threads:

Our monitoring also showed us that the CPU load went up, the I/O system got more to do and more rows were modified...

What was also noticeable is that with parallel replication, Foreign Key errors suddenly occurred, a phenomenon that we had not observed before:

FromDual.maas2.prod2 - Warning: InnoDB Foreign Key error detected

Trigger: InnoDB Foreign Key error detected
Trigger status: PROBLEM
Trigger severity: Warning
Trigger URL: https://fromdual.com/innodb-foreign-key-error-detected

Item values: 1

1. InnoDB new Foreign Key error (FromDual.maas2.prod2:FromDual.MySQL.innodb.ForeignKey_new): 1

With the command SHOW ENGINE INNODB STATUSG you can inspect these accordingly or view them in the monitoring:

------------------------
LATEST FOREIGN KEY ERROR
------------------------
2024-04-02 10:36:39 0x7f36088ff640 Transaction:
TRANSACTION 7199599266, ACTIVE 0 sec inserting
mysql tables in use 1, locked 1
6 lock struct(s), heap size 1128, 3 row lock(s), undo log entries 1
MariaDB thread id 228555, OS thread handle 139870048613952, query id 28453893 Write_rows_log_event::write_row(-1) on table `alerts`
insert into alerts (alertid,actionid,eventid,userid,clock,mediatypeid,sendto,subject,message,status,error,esc_step,alerttype,acknowledgeid,parameters) values (203687,4,471733,3,1712044003,1,\'xxx@fromdual.com\',\'Zabbix server - High: Too many processes on Zabbix server\',\'Trigger: Too many processes on Zabbix server
Trigger status: PROBLEM
Trigger severity: High
Trigger URL:

Item values: 309

1. Number of processes (Zabbix server:proc.num[]): 309\',3,\'\',1,0,null,\'{}\')
Foreign key constraint fails for table `zabbix`.`alerts`:
,
CONSTRAINT `c_alerts_2` FOREIGN KEY (`eventid`) REFERENCES `events` (`eventid`) ON DELETE CASCADE in parent table, in index alerts_3 tuple:
DATA TUPLE: 2 fields;
...

But in parent table `zabbix`.`events`, in index PRIMARY,
the closest match we can find is record:
PHYSICAL RECORD: n_fields 12; compact format; info bits 0
...

Literature/Sources

Parallel Replication

Taxonomy upgrade extras: replicationmariadbparallelmulti-threaded

The post MariaDB's parallel replication to catch up appeared first on MariaDB.org.

Shinguz: MariaDB's parallel replication to catch up