-
- Downloads
Apply column formatting to CSV exports (#36484)
* Apply column formatting to CSV exports This PR applies the same formatting logic to CSV exports as it does to pulse bodies (e.g. HTML). Formerly, there were two related formatting paths: - `metabase.query-processor.streaming.common/format-value`, which is from a protocol that takes an object and returns a string. It is what was used to export csv data. It applies no actual formatting, only converts objects to strings. - `metabase.pulse.render.body/get-format`, which builds a formatter from an optional time zone, column metadata, and visualization settings. This formatter takes a string and formats it. It was only used for rendering inline artifacts, such as embedded HTML in emails. The first function is insufficient to format row data as it is unaware of the column metadata and viz settings. We need to use that data everywhere data is exported in a uniform way. The solution is to lift `get-format` from `metabase.pulse.render.body` to a more common location (`metabase.pulse.render.common` in this PR step, but needs to be moved out of the pulse code to be a more shared concern) and use it everywhere artifacts are generated. For csv export, this was achieved as follows in `metabase.query-processor.streaming.csv`: ```clojure (defmethod qp.si/streaming-results-writer :csv [_ ^OutputStream os] (let [writer (BufferedWriter. (OutputStreamWriter. os StandardCharsets/UTF_8)) formatters (atom {})] (reify qp.si/StreamingResultsWriter (begin! [_ {{:keys [ordered-cols results_timezone]} :data} viz-settings] (swap! formatters (constantly (zipmap ordered-cols (map (fn [col] (p.common/get-format results_timezone col viz-settings)) ordered-cols)))) (csv/write-csv writer [(map (some-fn :display_name :name) ordered-cols)]) (.flush writer)) (write-row! [_ row _row-num cols {:keys [output-order]}] (let [[ordered-row ordered-cols] (if output-order (let [row-v (into [] row) cols-v (into [] cols)] [(for [i output-order] (row-v i)) (for [i output-order] (cols-v i))]) [row cols])] (csv/write-csv writer [(map (fn [c r] (let [formatter (@formatters c)] (formatter (common/format-value r)))) ordered-cols ordered-row)]) (.flush writer))) (finish! [_ _] ;; TODO -- not sure we need to flush both (.flush writer) (.flush os) (.close writer))))) ``` The formatters for each column are build in the `begin!` function and then looked up in each `write-row!`. The existing `format-value` is used to produce a string then passed into our looked up column formatter. Note that the new unit tests simulate a pulse and grab the provided temp files as attachments and analyzes those for correctness. This should work in a CI environment so long as the test process has permission to both write attachments to the temp directory and read those attachments back out. Also note that these tests can be slow (but not terribly so). Primary changes: - `metabase.email.messages` - fix spelling - `metabase.pulse.render.body` - move `get-format` out of this ns - `metabase.pulse.render.common` - move `get-format` into this ns - `metabase.query-processor.streaming.csv` - new logic to apply pulse renderer formatting to csv exports - `metabase.pulse.pulse-integration-test` - adding unit tests One TODO before a final commit of this PR is to move the `get-format` logic out of a pulse ns into something more general. Ultimately, it would be nice if this was a common capability used by both BE and FE. * Updated formatter state The rows in qp.si/StreamingResultsWriter are ordered, so there should be no need to do a lookup by col. We should just be able to map the ordered-formatters (using the same order as the cols) with the ordered row data. * Updating tests in `metabase.query-processor.streaming-test` to reflect new csv formatting. * Updated fomatter for tests and consistency This updates several tests (metabase.query-processor.streaming.csv-test) to reflect the new and correct behavior of using a common csv formatter as well as update the formatting code to correctly handle relational types and datetimes. * require fmt * Updating metabase.pulse.render.body-test to reflect `:type/DateTime` formatting. * Updating sqlite test * Updating report-timezone-test to reflect new CSV rendering * Fixing e2e test
Showing
- e2e/test/scenarios/sharing/downloads/reproductions/10803-timestamp-formatting.cy.spec.js 1 addition, 1 deletion...loads/reproductions/10803-timestamp-formatting.cy.spec.js
- src/metabase/email/messages.clj 3 additions, 3 deletionssrc/metabase/email/messages.clj
- src/metabase/pulse/render/body.clj 1 addition, 17 deletionssrc/metabase/pulse/render/body.clj
- src/metabase/pulse/render/common.clj 21 additions, 1 deletionsrc/metabase/pulse/render/common.clj
- src/metabase/pulse/render/datetime.clj 5 additions, 2 deletionssrc/metabase/pulse/render/datetime.clj
- src/metabase/query_processor/streaming/csv.clj 13 additions, 4 deletionssrc/metabase/query_processor/streaming/csv.clj
- test/metabase/pulse/pulse_integration_test.clj 103 additions, 8 deletionstest/metabase/pulse/pulse_integration_test.clj
- test/metabase/pulse/render/body_test.clj 19 additions, 19 deletionstest/metabase/pulse/render/body_test.clj
- test/metabase/pulse/render/common_test.clj 13 additions, 0 deletionstest/metabase/pulse/render/common_test.clj
- test/metabase/pulse/render/datetime_test.clj 5 additions, 1 deletiontest/metabase/pulse/render/datetime_test.clj
- test/metabase/query_processor/streaming/csv_test.clj 18 additions, 16 deletionstest/metabase/query_processor/streaming/csv_test.clj
- test/metabase/query_processor/streaming_test.clj 42 additions, 9 deletionstest/metabase/query_processor/streaming_test.clj
Loading
Please register or sign in to comment