gdal-async
v3.9.2
Published
Bindings to GDAL (Geospatial Data Abstraction Library) with full async support
Downloads
12,705
Maintainers
Readme
node-gdal-async
Read and write raster and vector geospatial datasets straight from Node.js with this native asynchronous GDAL binding. To get started, browse the API Documentation or examples.
When in doubt on how to use a method, check also the unit tests.
I am looking for a company willing to cover the costs of renting a remote Apple M1 host in order to provide support for the new Apple Macbooks. If paying by the hour, you should expect about 1 or 2 days of uptime per month. Contact me via a message on mmomtchev/node-gdal-async#30.
Fork notes
This project is a fork of https://github.com/contra/node-gdal-next which is a fork of https://github.com/naturalatlas/node-gdal.
It adds a number of features:
- Support for asynchronous IO (since 3.0)
- Full native TypeScript support with autocompletion in IDEs that support it (since 3.1)
- Built-in networking support with native support for cloud services (since 3.2)
- scijs/ndarray integration with zero-copy I/O (since 3.3)
- Read-only support for the new Multidimensional Raster Data Model of GDAL 3.1 (since 3.3)
- Library versions of GDAL's command-line utilities
gdalinfo
,gdal_translate
,ogr2ogr
andgdalwarp
(since 3.4) - Node.js Streams API for raster datasets (since 3.4)
- An alternative
gdal_calc.py
implementation supporting JS functions (since 3.4) ExprTk.js
integration for truly multi-threaded async computing (since 3.5)- VRT pixel functions support - both with JS functions and ExprTk expressions (since 3.5)
- Support for curve geometries (since 3.1)
- Progress callbacks (since 3.2)
- Built-in support for HDF5, NetCDF, GRIB, WMS, WMTS, WCS and FlatGeobuf (since 3.3)
- Numerous bugfixes including a number of memory leaks
Support for worker_threads
is planned but it is not a priority project
Long term support of this project
The birth of this project is related to a huge extortion in the geography community linked to a sexual harassment affair, linked to a penis-size problem, and covered up with corruption in the French Judiciary. It is maintained as a free service to the geography community so that it can remain as a constant remainder to OSGeo - who chose to back the extortion. I have been working on it while living on social welfare. You can safely use this framework in your projects, be assured that it will be maintained very well and for many years to come. Even if lately, this project has had a number of issues declared in a way as to make evident that my users support the extortion, I remain fully committed to maintaining it.
It's companion project on the client-side is rlayers
.
Installation
Pre-built binaries are provided for most recent Linux distributions, Windows 64 bit and OS X 10.15:
npm install gdal-async
Linking with your own version of GDAL
By default all dependencies are the latest versions and bundled out of the box, but if you would like to link against a custom GDAL library you will have to rebuild it when installing using the following flags:
# --shared_gdal allows linking to the OS-provided libgdal, requires libgdal-dev
# (debian: sudo apt-get install libgdal-dev)
$ npm install gdal-async --build-from-source --shared_gdal
If you have multiple versions of GDAL or your library is not in its standard location, you can use the usual CXXFLAGS
and LDFLAGS
variables:
$ CXXFLAGS="-I/some/path/include" LDFLAGS="-L/some/path/lib" npm install gdal-async --build-from-source --shared_gdal
Sample usage
Synchronous
Raster
const gdal = require("gdal-async")
const dataset = gdal.open("sample.tif")
console.log("number of bands: " + dataset.bands.count())
console.log("width: " + dataset.rasterSize.x)
console.log("height: " + dataset.rasterSize.y)
console.log("geotransform: " + dataset.geoTransform)
console.log("srs: " + (dataset.srs ? dataset.srs.toWKT() : 'null'))
Vector
const gdal = require("gdal-async")
const dataset = gdal.open("sample.shp")
const layer = dataset.layers.get(0)
console.log("number of features: " + layer.features.count())
console.log("fields: " + layer.fields.getNames())
console.log("extent: " + JSON.stringify(layer.extent))
console.log("srs: " + (layer.srs ? layer.srs.toWKT() : 'null'))
Asynchronous
Mixing of synchronous and asynchronous operations is supported.
Mixing of synchronous and asynchronous operations
Simultaneous operations on distinct dataset objects are always safe and can run it parallel.
Simultaneous operations on the same dataset object are safe too but they won't run in parallel. This is a limitation of GDAL. The only way to have multiple parallel operations on the same file is to use multiple dataset objects. Keep in mind that Node.js/libuv won't be able to detect which async contexts are waiting on each other, so if you launch 16 simultaneous operations on 4 different datasets, there is always a chance that libuv will pick 4 operations on the same dataset to run - which will take all 4 slots on the thread pool. It is recommended to either increase UV_THREADPOOL_SIZE
or to make sure that every dataset has exactly one operation running at any given time. Take a look at ASYNCIO.md
which explains this in detail.
Also be particularly careful when mixing synchronous and asynchronous operations in server code. If a GDAL operation is running in the background for any given Dataset, all synchronous operations on that same Dataset on the main thread will block the event loop until the background operation is finished. This includes synchronous getters and setters that might otherwise be instantaneous.. It is recommended to retrieve all values such as raster size or no data value or spatial reference before starting any I/O operations or use the new asynchronous getters available in 3.3.2 and later.
Does not support worker_threads
yet
The HDF5 driver is not thread safe on Windows
With callbacks
If the last argument of an xxxAsync
function is a callback,
it will be called on completion with standard (e,r) semantics
In this case the function will return a resolved Promise
const gdal = require('gdal-async');
gdal.openAsync('sample.tif', (e, dataset) => {
if (e) {
console.error(e);
return;
}
dataset.bands.get(1).pixels.readAsync(0, 0, dataset.rasterSize.x,
dataset.rasterSize.y, (e, data) => {
if (e) {
console.error(e);
return;
}
console.log(data);
});
});
With promises
If there is no callback, the function will return a Promise
const gdal = require('gdal-async')
gdal.openAsync('sample.tif')
.then((dataset) => dataset.bands.getAsync(1))
.then((band) => band.pixels.readAsync(0, 0, band.ds.rasterSize.x, band.ds.rasterSize.y))
.then((data) => console.log(data))
.catch((e) => console.error(e))
With async/await
const gdal = require('gdal-async');
try {
const ds = await gdal.openAsync('sample.tif');
const band = await ds.bands.getAsync(1);
const rasterSize = await ds.rasterSizeAsync; // starting with 3.3.2
const data = await band.pixels.readAsync(0, 0, rasterSize.x, rasterSize.y);
} catch (e => console.error(e));
TypeScript (starting from 3.1)
TypeScript support is available beginning with [email protected]
import * as gdal from 'gdal-async'
const ds1: gdal.Dataset = gdal.open('sample.tif')
const ds2: Promise<gdal.Dataset> = gdal.openAsync('sample.tif')
In TypeScript gdal
is a collection of named exports - as this is the preferred way of importing a native module in TypeScript and it does not impose the use of the esModuleInterop
flag.
MJS (Node.js ES6 modules)
When importing from a so-called Michael Jackson Script environment (Node.js native ES6 modules), the correct import syntax is:
import gdal from 'gdal-async'
In MJS gdal
is a default export - as this allows for maximum compatibility with existing CJS applications.
Built-in networking (starting from 3.2)
Built-in networking uses an embedded version of libcurl
. It supports zlib
compression through Node.js' own zlib
support. It does not support brotli
or zstd
. Node.js includes brotli
, but as of Node.js 16 it still does not export these symbols for use by add-ons (yes, go bug them - ask them for c-ares too).
SSL on Linux uses OpenSSL through Node.js' own support. It uses the curl trusted root CA store by default and another store can be provided through the CURL_CA_BUNDLE
environment variable or GDAL config option. SSL on Windows and OSX uses the OS-provided mechanisms - Schannel and SecureTransport respectively - and thus the trusted root CA store will be the one provided by the OS.
Breaking changes relative to node-gdal / node-gdal-next
If you are coming from
node-gdal-next
- Starting with 3.2
gdal.Geometry.fromGeoJson
now throws anError
on error instead of returning a null geometry
- Starting with 3.3
- Drop support for linking against a shared GDAL 1.x library
- Drop support for Node.js 10.x and 15.x
- Linux uses lazy-binding meaning that parts of the binary won't be loaded in memory until the first function call
gdal.DatasetBands.get{Async}()
now throws anError
if an invalid band is requested instead of returning a null objectgdal.DatasetLayers.get{Async}()
now throws anError
if an invalid layer is requested instead of returning a null objectgdal.FeatureDefnFields.get{Async}()
now throws anError
if an invalid field definition is requested instead of returning a null objectgdal.FeatureFields.get{Async}()
now throws anError
if an invalid field is requested instead of returning a null object- it can still return
null
if the field exists in the definition but it is not set in the feature
- it can still return
gdal.GDALDrivers.get()
now throws anError
if an invalid driver is requested instead of returning a null objectgdal.GeometryCollectionChildren.get()
now throws anError
if an invalid sub-geometry is requested instead of returning a null objectgdal.LayerFeatures.get{Async}()
now throws anError
if an invalid feature is requested instead of returning a null objectgdal.LayerFields.get{Async}()
now throws anError
if an invalid field is requested instead of returning a null objectgdal.LineStringPoints.get()
now throws anError
if an invalid point is requested instead of returning a null objectgdal.PolygonRings.get()
now throws anError
if an invalid ring is requested instead of returning a null object- All
gdal.Geometry
methods that return agdal.Geometry
throw anError
on error instead of returning a null object
- Starting with 3.4
- GDAL >= 2.1 is required
- All platforms use lazy-binding meaning that parts of the binary won't be loaded in memory until the first function call
gdal.RasterBand.noDataValue
now returnsNaN
if the original value wasNaN
instead of null; when setting it tonull
it correctly clears theNoDataValue
instead of setting it toNaN
; one should still exercise care when usingNaN
as aNoDataValue
- since even if the binary representation is defined by IEEE 754, different formats, compilers or architectures may exhibit different behavior- TypeScript bindings now internally use
strict: true
mode with many getters and functions receiving an additional explicitnull
return type, consider usingstrictNullChecks: false
in yourtsconfig.json
if you need to suppress these new errors in existing code
- Starting with 3.2
If you are coming from
node-gdal
, in addition to all of the above- With PROJ 6+, the order of coordinates for EPSG geographic coordinate reference systems is latitude first,
longitude second. If you don't want to make large code changes, you can replace code like
gdal.SpatialReference.fromEPSG(4326)
withgdal.SpatialReference.fromProj4('+init=epsg:4326')
- With PROJ 6+, the order of coordinates for EPSG geographic coordinate reference systems is latitude first,
longitude second. If you don't want to make large code changes, you can replace code like
With ndarray
from scijs
A separate plugin ndarray-gdal allows zero-copy I/O, with GDAL-backed interleaving in C++ using SIMD instructions, for all possible strides both for 2D raster data and N-dimensional MDArray
data. The plugin requires at least [email protected]
.
With ExprTk.js
A separate plugin gdal-exprtk adds seamless integration with ExprTk.js allowing for truly multi-threaded and asynchronous computing with raster datasets. The plugin requires at least [email protected]
.
Pruning the source tree
A special NPM script target, npm run prune
allows to delete most of the bundled source packages - these are not needed unless rebuilding from source. It significantly reduces the overall size of this module.
Using in Amazon Linux Lambdas
Amazon Linux was supported starting from [email protected]
. There is no shared GDAL support, only the bundled GDAL version is supported. It is highly recommended that you prune the module before deploying to an Amazon Lambda - when pruned the module is only 16MB zipped. Starting from [email protected]
only Amazon 2022 will be supported.
Using with Electron
Refer to the provided example in examples/electron
. Prepare for some very rough edge on Windows due to #29893. Also https
is not supported when running in Electron.
Bundling with rollup
If you are bundling your application for production deployment, gdal-async
can be bundled with rollup
through rollup-plugin-natives
since rollup-plugin-natives#13
. Here is a an example configuration:
plugins: [
native({
copyTo: 'build/prod/lib',
destDir: './lib'
}),
copy({
targets: [
{
src: 'node_modules/gdal-async/deps/libgdal/gdal/data',
dest: 'build/deps/libgdal/gdal'
},
{
src: 'node_modules/gdal-async/deps/libproj/proj/data',
dest: 'build/deps/libproj/proj'
}
]
})
]
You can check XC-DB for a working project that uses gdal-async
with Express
and is bundled for production with rollup
.
Bundling with Next.js
As gdal-async
is a Node.js native module, it does not work in client side components which run in the client browser. You can however use it in server side components, if you can ensure that this code will never get called on the client side.
Add this to your next.config.js
:
/** @type {import('next').NextConfig} */
export default {
webpack: (config) => {
const externals = ['gdal-async'];
if (config.externals)
config.externals.push(...externals);
else
config.externals = externals;
return config;
}
};
Known issues
- #2 When running multiple parallel async operations per
Dataset
and on multipleDataset
s, thread starvation is possible as explained inASYNCIO.md
- #11 Accessing a closed
Dataset
object results in a warning with a stack trace printed out to stdout - #17 HDF5 on Windows is not thread-safe
- #28
gdal-async
is not compatible with Electron >= 16.0.0 on Windows
Bundled Drivers
When using the bundled GDAL version, the following drivers will be available:
AAIGrid
, ACE2
, ADRG
, AIG
, AVCBin
, AVCE00
, AirSAR
, BLX
, BMP
, BT
, carto
, CEOS
, COASP
, COG
, COSAR
, CPG
, CSV
, CTG
, CTable2
, DGN
, DIMAP
, DIPEx
, DOQ1
, DOQ2
, DTED
, DXF
, ECRGTOC
, EDIGEO
, EHdr
, EIR
, ELAS
, ENVI
, ERS
, ESAT
, ESRI Shapefile
, MapInfo File
, MBTiles
, FAST
, FIT
, FlatGeobuf
, GFF
, GML
, GPSBabel
, GPSTrackMaker
, GPX
, GRASSASCIIGrid
, GRIB
, GS7BG
, GSAG
, GSBG
, GSC
, GTX
, GTiff
, GenBin
, GeoJSON
, GeoRSS
, Geoconcept
, GPKG
, HDF5
, HF2
, HFA
, ILWIS
, IRIS
, ISIS2
, ISIS3
, Idrisi
, JAXAPALSAR
, JDEM
, JPEG
, JSONFG
, KMLSUPEROVERLAY
, KML
, KRO
, L1B
, LAN
, LCP
, LOSLAS
, Leveller
, MAP
, MEM
, Memory
, MFF2
, MFF
, MITAB
, MVT
, NDF
, NetCDF
, NGSGEOID
, NITF
, NTv2
, NWT_GRC
, NWT_GRD
, OGR_PDS
, OGR_SDTS
, OGR_VRT
, OpenJPEG
, OSM
, OpenFileGDB
, PAux
, PCIDSK
, PDS
, PGDUMP
, PNG
, PMTiles
, PNM
, RMF
, ROI_PAC
, RPFTOC
, RS2
, RST
, R
, S57
, S100
, S102
, SAGA
, SAR_CEOS
, SDTS
, SGI
, SNODAS
, SQLite
, SRP
, SRTMHGT
, SVG
, SXF
, TIL
, TSX
, Terragen
, UK .NTF
, USGSDEM
, VICAR
, VRT
, vsiaz
, vsicurl
, vsigs
, vsigzip
, vsimem
, vsioss
, vsis3
, WAsP
, WCS
, WMS
, WMTS
, XPM
, XYZ
, ZMap
When compiling against a system-installed shared GDAL, all drivers and projections supported by it, should also be supported by gdal-async
.
Contributors
This binding was originally the product of a collaboration between Natural Atlas and Mapbox. Its contributors are Brandon Reavis, Brian Reavis, Dane Springmeyer, Zac McCormick, and others.
node-gdal-next is maintained by @contra.
The async bindings, the curve geometries, the TypeScript support, the built-in networking and the multidimensional raster model are by @mmomtchev who is the current maintainer.
Before submitting pull requests, please update the tests and make sure they all pass.
$ npm test # test against bundled gdal
$ npm run test:shared # test against most major versions
# test against shared gdal on given Linux version and Node.js version
$ npm run container dev {ubuntu|centos|fedora|debian|archlinux|amazon}:{version} 14|16|18|20|lts shared
License
Copyright © 2015–2017 Natural Atlas, Inc. & Contributors
Copyright © 2020-2021 Momtchil Momtchev, @mmomtchev & Contributors
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Unit-tested platforms with pre-built binaries
Release binaries with pre-built bundled GDAL are tested against the full matrix of:
- Node.js versions: 14.x, 16.x, 17.x, 18.x, 20.x
- OS: Ubuntu 20.04, Ubuntu 22.04, CentOS 9 Stream, Fedora 36, Fedora 37, Debian 10 buster, Debian 11 bullseye, Arch Linux current, Amazon Linux, Windows Server 2019, macOS Catalina 10.15 (EOL), macOS Monterey 12
On Ubuntu 16.04, Amazon Linux, Windows and macOS only the bundled GDAL version is officially supported. On all other platforms both the bundled and the system-installed versions are supported.
Node.js 17 on ArchLinux is not ABI compatible with the NodeSource binaries and requires rebuilding from source.
On Linux, the binaries require glibc 2.23 (Ubuntu 16), except for Node.js 18 which requires glibc 2.28 (Ubuntu 20 / Debian 10).
Development versions are unit tested for the following targets: