-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Contributor meetings
The Trino project organizes regular meetings with contributors to improve collaboration with maintainers and among contributors. This space acts is used to plan upcoming meetings and provide meeting minutes after the event.
- Trino Contributor Congregations are in-person meetings, typically run the day after Trino Fest or Trino Summit.
- Trino Contributor Calls are virtual events, typically scheduled for every fourth Thursday of the month.
- Anyone can attend.
- Attendees can request invites from the Trino dev rel team (Manfred Moser (@mosabua), Cole Bowden (@colebow), Brian Olsen (@bitsondatadev)) on slack.
- Attendees can also just join on the day, connections details are announced just prior to the event on slack or on this page.
- Event dates are announced on slack, LinkedIn, and added to the Trino events calendar.
- Topics can be suggested prior to the event by updating this page or during the meeting.
- 📹 Video recordings are posted on the dedicated YouTube playlist starting with the April 2024 call.
- Meeting notes and other details are captured on this page.
Date to be determined.
Potential topics:
- Discuss proposal to work on CBO improvements - see https://github.com/trinodb/trino/issues/21785, proposed by devozerov
In person only event in Boston following Trino Fest. Contact Manfred Moser for invite and registration info. 9:00 - 14:00 EST
Topics, notes and action items
- Iceberg aggregation pushdown presented and discussed by osscm
- Incremental refresh on materialized views in Iceberg and beyond osscm
- Open Policy Agent standard policies for relational data domain, related tooling and more vagaerg, dprophet
- Trino Gateway update with roadmap and planning discussion, mosabua
- Trino K8s operator project, osscm and mosabua
📹 Video recording with time stamps on YouTube
Attendees
mosabua, electrum, dain, nineinchnick, brianwmunz, dprophet, findinpath, xkrogen, jkylling, alprusty, marton-bod, vgankidi, osscm, vagaerg, walterddr, lxynov, mgorsk1 Praveen Sadhu, Josh Yeh
Topics, notes and action items
- Jan Was (nineinchnick) as new subproject maintainer for go client, grafana plugin, and helm chart
- Trino Fest and Trino Contributor Congregation in June - next Trino Contributor Call in July
- OpenLineage event listener plugin - PR finalization discussion - https://github.com/trinodb/trino/pull/21265, mgorsk1, alprusty,
- Trino to Trino connector update
- Incoming PR soon from https://github.com/sajjoseph/trino/tree/add-trino-to-trino-connector
- Further features to come from others as well
- LDAP group provider plugin
- https://github.com/trinodb/trino/pull/20157
- Author will pull out overloaded feature of mapping groups and roles, dain will then review towards merge
- Discussion around realtime dashboards
- Memory and Arrow connector discussion
- Discussion around embedding query engine (LocalQueryRunner test class)
- Pushdown and query passthrough
- https://github.com/trinodb/trino/blob/master/core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorMetadata.java#L1064
- Varies across connectors, see for example https://trino.io/docs/current/connector/postgresql.html#pushdown
- https://trino.io/docs/current/connector/postgresql.html#query-varchar-table
- Iceberg split rate limiting
- https://github.com/trinodb/trino/pull/18214 dain or findepi to look
- Client option for redirect
- https://github.com/trinodb/trino/pull/21027 electrum to follow up
- Agg pushdown for Iceberg
- https://github.com/trinodb/trino/pull/15832, electrum or findepi to look
- Add retry and jmx metrics while generating Thrift's delegation token [osscm]
- https://github.com/trinodb/trino/pull/21000 electrum to look/merge
- Support for storage table management of materialized views
- https://github.com/trinodb/trino/issues/21797
- agreed to proceed with series of PRs, starting with language, then SPI, then connector
- osscm to figure out next steps on their team, martint to help with initial PR
- Vectorized instruction usage and Java 22-related update (virtual threads)
Attendees
martint, mosabua, sajjoseph, wendigo, nineinchnick, brianwmunz, bitsondatadev, xkrogen, virajjasani, stoty, kmurra
Topics, notes and action items
📹 Video recording with time stamps on YouTube
- Status of upgrade to required Java 22
- Trino works with Java 22
- Trino 444 uses Java 22 in docker container
- Requirement for Java 22 in general to follow in 446, 447, or 448
- Java 23 also used in testing
- No issues reports so far
- Discuss Phoenix connector usage
- Security issue cause Trino project to contemplate removal
- martint, mosabua and bitsondatadev discuss options about deprecating, reducing impact or removing connector
- wendigo explain current and past approaches and issues, looking for help
- stoty and virajjasani explain options and offer to help
- see https://github.com/trinodb/trino/pull/20739 and others
- mosabua to follow up on how to proceed with wendigo and martint
- we are looking for people to help and people who use the connector
- reasonable for connector to require phoenix 5.2.0 and explicitly declare and manage dependencies for newer hadoop, phoenix and hbase
- Incremental refresh materialized views (#18673, #20959)
- mosabua explains that work is ongoing but nothing is there to report
- Lateral column alias support
- kmurra explains proposal and discusses with martint
- various complications such as shadowing aliases, usage in aggregation and window functions, and more
- kmurra will file issue with research on how it works on other engines and more
- kmurra to reach out to SQL spec workgroup
- martint will help with input and eventually review of PR for planner and parser
- Trino REST API improvement
- sajjoseph explains nextURI data in HTTP header values for usage with blue/green deployment and more behind load balancer or Trino Gateway
- he will file issue to discuss more and probably send a PR
- same for some performance improvements
9am PST
Attendees
martint, electrum, dain, mosabua, sajjoseph, jkylling, amoghmargoor, marton-bod, vgankidi, osscm, wendigo, monimiller, oneonestar, alprusty, nineinchnick, Praveen2112, brianwmunz, xingyuanlin, yathi, manoj narayanan, kasun indrasiri and others
Topics, notes and action items
- Request for speaker submissions for Trino Fest - call out by mosabua
- Trino Contributor Congregation after Trino Fest in person in Boston - call out to contact mosabua about attedance
- Discuss plans to move to Java 22, explained by mosabua, wendigo, martint, dain:
- https://trino.io/blog/2024/03/13/java-22
- https://github.com/trinodb/trino/issues/20980
- Build and runtime already work with Java 22
- One of next release will ship docker container with Java 22
- A few releases later we will switch to Java 22 as requirement, looking for feedback and testers
- going to include native code via aircompressor using foreign function support
- Java 22 should reduce locking problems from gzip
- JDBC driver and CLI will continue to stay with Java 8
- various other code base segments will see new language and library feature adoption
- V2 protocol might see adoption of new language and library features
- preview features will only be adopted if we see significant benefits
- we will also adopt Java 23, 24, 25 and so on soon after they are released
- Update on file system lead and related work - update from electrum
- removal of Hadoop/Hive library usage completed in new native file system support, massive project and refactor
- new FileSystemAPI is well designed, simple, clean, tested, no unneeded complexity
- electrum acts as lead
- not all legacy features moved
- looking for feedback from testing
- docs will be updated more by mosabua
- see https://trino.io/docs/current/object-storage.html
- old hdfs code will only be for HDFS, other object storage usage will use new
- currently planning to have all system disabled by default and require manual activation, looking for feedback on that behavior
- s3 security mapping and other improvements in progress or based on input
- OpenTelemetry
- proven to be VERY useful
- great TCB episode, https://trino.io/episodes/57
- looking for practical experience and PRs to add in other important places
- Merged separation of IR and AST - discussed by martint
- separation of IR (internal representation) and AST (abstract syntax tree)
- massive internal refactor
- fixed many latent, dormant bugs
- brings numerous performance improvements
- great code simplification and improvement
- separated as of 442 release, after series of very large PR,
- more cleanup work still ongoing, opportunity for future changes on IR, brings performance improvement,
- avoids lots of duplicate transforms,
- also removes internal caching needs,
- still a month or two of work ongoing, looking for feedback after that in terms of people working on separate forks and so on
- Discuss OkHTTP related regression around redirect - mosabua, electrum and wendigo
- https://github.com/trinodb/trino/issues/21026
- Brought issue in Trino Gateway sync
- Two separate fixes now available
- electrum and mosabua to follow up and sync and help with review and decision making,
- redirect with auth can be considered a security risk, might want to restore old behavior,
- might also have to go through clients and update them if possible, needs to potentially be made explicit,
- also chat about http client in jdbc driver and other clients,
- jetty client is too heavy, jdbc driver should probably use jvm http client
- Iceberg Aggregate Pushdown - mosabua and osscm will lead discussion to establish approach and then drive into more features and other connectors
- Call for reviewers, contributors, and maintainers - from mosabua, contact for guidance
- mosabua to plan next call in a few weeks
- Next call will be recorded and available on youtube channel
9am PST
Topics
- Upgrade to Java 21 with Trino 436
- Test improvements initiative and developer guide for testing
- Trino Gateway progress update
- Trino Kubernetes Operator
- Default time precision switch to 6 https://github.com/trinodb/trino/pull/20290
- Parallel data transfer (aka V2 protocol) between clients and coordinator (+workers) in JDBC/CLI/other client interfaces.
- Parquet column encryption support
- #18384 Rewrite partial top-n node to LimitNode or LastNNode
Attendees
martint, electrum, dain, mosabua, sajjoseph, rice668, alok, i-93, colebow, jkylling, amoghmargoor, marton-bod, vgankidi, osscm
Notes and action items
- Update on Java 21 provided, all done, no negative feedback from community
- Test improvements initiative and developer guide for testing is in progress and living document for further improvements
- Trino Gateway completed release 5, lots more improvements and progress ongoing, dev sync every second Wednesday, release 6 coming soon
- Trino Kubernetes operator - mosabua to start conversation to move forward similar to Trino Gateway subproject, looking for initial code contributions and then ongoing support and maintenance
- Default time precision change to 6
- PR https://github.com/trinodb/trino/pull/20290
- amoghmargoor will link to github issue
- mosabua to update to roadmap issue and add tasks with martint and others
- feature will need a switch since this will be NOT be backwards compatible
- lots of impact on connectors, functions, and so on
- testing need will be significant
- Client protocol
- no active work going on according to dain
- parallel transfer to clients probably best to use object storage filesystem as proxy, parallel write by trino, parallel read by client
- mostly aimed at python and related workloads
- streaming in parallel from memory is not restartable
- lots of discussion about Arrow/ADBC, not suitable as protocol for Trino since has limited scope below what Trino offers in terms of data types and such, also tightly tied to Spark only, very limited interoperability, security issues
- in the long run Trino might end up with arrow support for import/export or so, only for interoperability due to problems of Arrow,
- Arrow not suitable for native use in Trino
- Separate aspect is trino to trino cluster communication
- jdbc just a current approach
- potential to create separate protocol for trino to trino
- communication in cluster is implementation details, arrow not suitable at all, no benefits
- Parquet column encryption
- in progress with amoghmargoor,
- blocked by hadoop dependency,
- trino filesystem no longer uses hadoop deps
- amoghmargoor will reimplement in collab with electrum
- mosabua followed up - details at https://github.com/trinodb/trino/pull/20069
- Alluxio caching
- PR for Delta Lake from jkylling is ready for merge
- PR for docs in progress with mosabua
- Iceberg and Hive support coming in quick follow up PRs
- Rubix removal PR is ready as well
- Potential for everything to land together in 439
- mosabua to follow up on coordination
- Iceberg agg pushdown
- osscm working with findepi on multiple PRs
- including work in Iceberg project, and ideas for future PRs
- electrum to help
- Secondary index and aggregate index, Iceberg
- osscm working with Iceberg and Trino community
- rice668 working on another(?) implementation
- mosabua to connect and start thread with electrum and findepi
- Rewrite partial top-n node to LimitNode or LastNNode
- PR from rice668
- Assumption of all files being sorted and having correct ordering might be problematic
- martint to chime in with input and potential ideas, https://github.com/trinodb/trino/pull/18384
- mosabua to help coordinate meeting / sync
Prior events include various calls and the first Trino Contributor Congregation at Trino Summit 2022.