mirror of
https://github.com/LukeHagar/pypistats.dev.git
synced 2025-12-09 12:47:48 +00:00
Switch to bigquery-public-data datset from the-psf (#39)
This commit is contained in:
@@ -297,11 +297,11 @@ def get_query(date):
|
|||||||
details.python AS python_version,
|
details.python AS python_version,
|
||||||
details.system.name AS system
|
details.system.name AS system
|
||||||
FROM
|
FROM
|
||||||
`the-psf.pypi.file_downloads`
|
`bigquery-public-data.pypi.file_downloads`
|
||||||
WHERE
|
WHERE
|
||||||
DATE(timestamp) = '{date}'
|
DATE(timestamp) = '{date}'
|
||||||
AND
|
AND
|
||||||
(REGEXP_CONTAINS(details.python,r'^[0-9]\.[0-9]+.{{0,}}$') OR
|
(REGEXP_CONTAINS(details.python,r'^[0-9]\.[0-9]+.{{0,}}$') OR
|
||||||
details.python IS NULL)
|
details.python IS NULL)
|
||||||
)
|
)
|
||||||
SELECT
|
SELECT
|
||||||
|
|||||||
@@ -8,7 +8,7 @@
|
|||||||
Index in lieu of having to execute queries against raw download records in Google BigQuery.</p>
|
Index in lieu of having to execute queries against raw download records in Google BigQuery.</p>
|
||||||
<h3>Data</h3>
|
<h3>Data</h3>
|
||||||
<p>Download stats are sourced from the Python Software Foundation's publicly available
|
<p>Download stats are sourced from the Python Software Foundation's publicly available
|
||||||
<a href="https://bigquery.cloud.google.com/table/the-psf:pypi.downloads">download stats</a>
|
<a href="https://bigquery.cloud.google.com/table/bigquery-public-data:pypi.downloads">download stats</a>
|
||||||
on Google BigQuery. All aggregate download stats ignore known PyPI mirrors (such as
|
on Google BigQuery. All aggregate download stats ignore known PyPI mirrors (such as
|
||||||
<a href="{{ url_for('general.package_page', package='bandersnatch') }}">bandersnatch</a>) unless noted
|
<a href="{{ url_for('general.package_page', package='bandersnatch') }}">bandersnatch</a>) unless noted
|
||||||
otherwise.</p>
|
otherwise.</p>
|
||||||
|
|||||||
@@ -27,7 +27,7 @@
|
|||||||
</p>
|
</p>
|
||||||
<p>
|
<p>
|
||||||
You are much better off extracting the data directly from the Google
|
You are much better off extracting the data directly from the Google
|
||||||
BigQuery <a href="https://bigquery.cloud.google.com/table/the-psf:pypi.downloads">pypi downloads tables</a>. You
|
BigQuery <a href="https://bigquery.cloud.google.com/table/bigquery-public-data:pypi.downloads">pypi downloads tables</a>. You
|
||||||
can query up to 1TB of data FREE every month before having to pay. The volume of data queried for this website
|
can query up to 1TB of data FREE every month before having to pay. The volume of data queried for this website
|
||||||
falls well under that limit (each month of data is less than 100 GB queried) and you will have your data
|
falls well under that limit (each month of data is less than 100 GB queried) and you will have your data
|
||||||
in a relatively short amount of time. <a
|
in a relatively short amount of time. <a
|
||||||
|
|||||||
@@ -8,7 +8,7 @@
|
|||||||
</h3>
|
</h3>
|
||||||
<p>
|
<p>
|
||||||
PyPI provides download records as a publicly available dataset on Google's BigQuery. You can access the data
|
PyPI provides download records as a publicly available dataset on Google's BigQuery. You can access the data
|
||||||
with a Google Cloud account <a href="https://bigquery.cloud.google.com/table/the-psf:pypi.downloads">here</a>.
|
with a Google Cloud account <a href="https://bigquery.cloud.google.com/table/bigquery-public-data:pypi.downloads">here</a>.
|
||||||
</p>
|
</p>
|
||||||
<h3>
|
<h3>
|
||||||
When is the website data updated?
|
When is the website data updated?
|
||||||
|
|||||||
Reference in New Issue
Block a user