40 releases (20 breaking)
0.49.1 | Feb 7, 2025 |
---|---|
0.48.3 |
|
0.48.2 | Nov 7, 2024 |
0.45.0 | Jul 29, 2024 |
0.30.5 | Mar 31, 2023 |
#1 in #opendal
1,488 downloads per month
Used in datafusion-dft
3.5MB
65K
SLoC
Apache OpenDAL™ object_store integration
object_store_opendal
is an object_store
implementation using opendal
.
This crate can help you to access 30 more storage services with the same object_store API.
Useful Links
Examples
opendal_store_opendal
depends on the opendal
crate. Please make sure to always use the latest versions of both.
latest object_store_opendal
latest opendal
1. using object_store
API to access S3
Add the following dependencies to your Cargo.toml
with correct version:
[dependencies]
object_store = "0.11.0"
object_store_opendal = "xxx" # see the latest version above
opendal = { version = "xxx", features = ["services-s3"] } # see the latest version above
Build OpendalStore
via opendal::Operator
:
use std::sync::Arc;
use bytes::Bytes;
use object_store::path::Path;
use object_store::ObjectStore;
use object_store_opendal::OpendalStore;
use opendal::services::S3;
use opendal::{Builder, Operator};
#[tokio::main]
async fn main() {
let builder = S3::from_map(
vec![
("access_key".to_string(), "my_access_key".to_string()),
("secret_key".to_string(), "my_secret_key".to_string()),
("endpoint".to_string(), "my_endpoint".to_string()),
("region".to_string(), "my_region".to_string()),
]
.into_iter()
.collect(),
).unwrap();
// Create a new operator
let operator = Operator::new(builder).unwrap().finish();
// Create a new object store
let object_store = Arc::new(OpendalStore::new(operator));
let path = Path::from("data/nested/test.txt");
let bytes = Bytes::from_static(b"hello, world! I am nested.");
object_store.put(&path, bytes.clone().into()).await.unwrap();
let content = object_store
.get(&path)
.await
.unwrap()
.bytes()
.await
.unwrap();
assert_eq!(content, bytes);
}
2. querying data in a S3 bucket using DataFusion
Add the following dependencies to your Cargo.toml
with correct version:
[dependencies]
object_store = "0.11.0"
object_store_opendal = "xxx" # see the latest version above
opendal = { version = "xxx", features = ["services-s3"] } # see the latest version above
datafusion = "44.0.0"
url = "2.5.2"
Build OpendalStore
via opendal::Operator
and register it to DataFusion
:
use datafusion::error::DataFusionError;
use datafusion::error::Result;
use datafusion::prelude::*;
use opendal::services::S3;
use opendal::Operator;
use std::sync::Arc;
use url::Url;
#[tokio::main]
async fn main() -> Result<()> {
let ctx = SessionContext::new();
// Configure OpenDAL for S3
let region = "my_region";
let bucket_name = "my_bucket";
let builder = S3::default()
.endpoint("my_endpoint")
.bucket(bucket_name)
.region(region)
.access_key_id("my_access_key")
.secret_access_key("my_secret_key");
let op = Operator::new(builder)
.map_err(|err| DataFusionError::External(Box::new(err)))?
.finish();
let store = object_store_opendal::OpendalStore::new(op);
// Register the object store
let path = format!("s3://{bucket_name}");
let s3_url = Url::parse(&path).unwrap();
ctx.register_object_store(&s3_url, Arc::new(store));
// Register CSV file as a table
let path = format!("s3://{bucket_name}/csv/data.csv");
ctx.register_csv("trips", &path, CsvReadOptions::default())
.await?;
// Execute the query
let df = ctx.sql("SELECT * FROM trips LIMIT 10").await?;
// Print the results
df.show().await?;
// Dynamic query using the file path directly
let ctx = ctx.enable_url_table();
let df = ctx
.sql(format!(r#"SELECT * FROM '{}' LIMIT 10"#, &path).as_str())
.await?;
// Print the results
df.show().await?;
Ok(())
}
WASM support
To build with wasm32-unknown-unknown
target, you need to enable the send_wrapper
feature:
cargo build --target wasm32-unknown-unknown --features send_wrapper
Branding
The first and most prominent mentions must use the full form: Apache OpenDAL™ of the name for any individual usage (webpage, handout, slides, etc.) Depending on the context and writing style, you should use the full form of the name sufficiently often to ensure that readers clearly understand the association of both the OpenDAL project and the OpenDAL software product to the ASF as the parent organization.
For more details, see the Apache Product Name Usage Guide.
License and Trademarks
Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0
Apache OpenDAL, OpenDAL, and Apache are either registered trademarks or trademarks of the Apache Software Foundation.
Dependencies
~12–24MB
~341K SLoC