{"type":"video","version":"1.0","html":"<iframe src=\"https://www.loom.com/embed/e3fac8ddb842438cb38beaf07f8a43a4\" frameborder=\"0\" width=\"1920\" height=\"1440\" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>","height":1440,"width":1920,"provider_name":"Loom","provider_url":"https://www.loom.com","thumbnail_height":1440,"thumbnail_width":1920,"thumbnail_url":"https://cdn.loom.com/sessions/thumbnails/e3fac8ddb842438cb38beaf07f8a43a4-9e0609f1fe8f65c7.gif","duration":183.915,"title":"Run a DCF Collector with GitHub API","description":"This Loom demonstrates how to use DCF to pull data from a public API into a local lakehouse using a quick init and run flow. It creates an empty project via uvx from dcf core dczf init, setting the catalog type to local, which generates project.yaml and a collectors directory defining a GitHub API collector for repository commits. Running uv run dcf commits loads data into a local DuckDB Lakehouse as parquet files under a Warehouse directory, writing 100 rows. It then shows querying the loaded data with dcfquery using SQL, such as selecting from github.dcf commits with a limit of 10."}