Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: upstream-downstream entity linking #36111

Open
wants to merge 13 commits into
base: master
Choose a base branch
from

Conversation

navinkarkera
Copy link
Contributor

@navinkarkera navinkarkera commented Jan 15, 2025

Description

Adds tasks, signal handlers and api's for adding, updating and deleting upstream->downstream links in database.

Supporting information

Testing instructions

  • Checkout this PR.
  • Run tutor images build openedx-dev && tutor dev launch -I --skip-build in your tutor nightly setup.
  • Verify the new database tables by visiting django adming: http://studio.local.openedx.io:8001/admin/oel_linking/
  • To test new management command run
    • tutor dev exec cms ./manage.py cms recreate_upstream_links --all for creating upstream links for all courses.
    • tutor dev exec cms ./manage.py cms recreate_upstream_links --course <course-id> for creating upstream links for a course.
    • use --force argument to recreate upstream links for already indexed courses.
  • Verify the links by visiting the admin page again: http://studio.local.openedx.io:8001/admin/oel_linking/
  • To test automatic creation of these links for new blocks, create new xblock using Library content or Problem bank component or Copy-pasting any library content in any course. The corresponding links should be created in database.
  • Test update of link versions by updating upstream library block and accepting or rejecting the change in course XBlock.
  • Test duplicating a block with upstream, copying and pasting a block with upstream.
  • Deleting an XBlock should delete its link.
  • Test course name update in links by updating course name in Advanced Settings page.
  • Test course import -> This does not work! Working on fixing it

Deadline

"None" if there's no rush, or provide a specific date or event (and reason) if there is one.

Other information

Include anything else that will help reviewers and consumers understand the change.

  • Does this change depend on other changes elsewhere?
  • Any special concerns or limitations? For example: deprecations, migrations, security, or accessibility.
  • If your database migration can't be rolled back easily.

@openedx-webhooks openedx-webhooks added the open-source-contribution PR author is not from Axim or 2U label Jan 15, 2025
@openedx-webhooks
Copy link

openedx-webhooks commented Jan 15, 2025

Thanks for the pull request, @navinkarkera!

This repository is currently maintained by @openedx/wg-maintenance-edx-platform.

Once you've gone through the following steps feel free to tag them in a comment and let them know that your changes are ready for engineering review.

🔘 Get product approval

If you haven't already, check this list to see if your contribution needs to go through the product review process.

  • If it does, you'll need to submit a product proposal for your contribution, and have it reviewed by the Product Working Group.
    • This process (including the steps you'll need to take) is documented here.
  • If it doesn't, simply proceed with the next step.

🔘 Provide context

To help your reviewers and other members of the community understand the purpose and larger context of your changes, feel free to add as much of the following information to the PR description as you can:

  • Dependencies

    This PR must be merged before / after / at the same time as ...

  • Blockers

    This PR is waiting for OEP-1234 to be accepted.

  • Timeline information

    This PR must be merged by XX date because ...

  • Partner information

    This is for a course on edx.org.

  • Supporting documentation
  • Relevant Open edX discussion forum threads

🔘 Get a green build

If one or more checks are failing, continue working on your changes until this is no longer the case and your build turns green.


Where can I find more information?

If you'd like to get more details on all aspects of the review process for open source pull requests (OSPRs), check out the following resources:

When can I expect my changes to be merged?

Our goal is to get community contributions seen and reviewed as efficiently as possible.

However, the amount of time that it takes to review and merge a PR can vary significantly based on factors such as:

  • The size and impact of the changes that it introduces
  • The need for product review
  • Maintenance status of the parent repository

💡 As a result it may take up to several weeks or months to complete a review and merge your PR.

Copy link
Contributor

@pomegranited pomegranited left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is mostly working :) But when I import a course, the links don't get updated, and I'm not sure why not.

Most of my comments are nits about reducing unneeded log volume, or making logs more informative.

Comment on lines 235 to 238
# update_course_name_in_upstream_links.delay(
# str(previous_course_overview.id),
# updated_course_overview.display_name_with_default
# )
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO: remove commented-out code

Create or update upstream->downstream link in database for given xblock.
"""
if not xblock.upstream:
log.info(f"No upstream found for xblock: {xblock.usage_key}")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: this will generate a lot of log messages when people change course names, because most course blocks aren't upstream-linked.

Suggested change
log.info(f"No upstream found for xblock: {xblock.usage_key}")

except ObjectDoesNotExist:
log.exception("Library block not found!")
lib_component = None
authoring_api.update_or_create_entity_link(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we're returning None when we do nothing, shouldn't this return something truthy?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is possible that a course is imported which has an xblock that an upstream which does not exist in this instance. We want to still save them and link them when the library block is imported (this is not implemented yet, I think I'll do it as part of a future task).

Comment on lines +213 to +224
@receiver(XBLOCK_CREATED)
@receiver(XBLOCK_UPDATED)
def create_or_update_upstream_downstream_link_handler(**kwargs):
"""
Automatically create or update upstream->downstream link in database.
"""
xblock_info = kwargs.get("xblock_info", None)
if not xblock_info or not isinstance(xblock_info, XBlockData):
log.error("Received null or incorrect data for event")
return

create_or_update_xblock_upstream_link.delay(str(xblock_info.usage_key))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I expected that this handler would take care of creating links when importing a course, but it didn't! I only saw a log message for the top-level course block:

No upstream or upstream_version found for xblock: block-v1:OpenCraft+DemoX+1+type@course+block@course

I had to run the management command to set my imported course links.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be fixed now.

ensure_cms("create_or_update_xblock_upstream_link may only be executed in a CMS context")
xblock = modulestore().get_item(UsageKey.from_string(usage_key))
if not xblock.upstream or not xblock.upstream_version:
TASK_LOGGER.info(f"No upstream or upstream_version found for xblock: {xblock.usage_key}")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto nit: this will generate a lot of log messages when people update their blocks -- I think we should remove this line.

Suggested change
TASK_LOGGER.info(f"No upstream or upstream_version found for xblock: {xblock.usage_key}")

try:
lib_component = get_component_from_usage_key(upstream_usage_key)
except ObjectDoesNotExist:
log.exception("Library block not found!")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we need a log.exception here, because this could happen as part of normal operation, e.g if the lib component gets deleted. But it would be good to know a bit more about what it means:

Suggested change
log.exception("Library block not found!")
log.error(f"Library component not found for {upstream_usage_key} -- removing from upstream link")

try:
course_name = CourseOverview.get_from_id(course_key).display_name_with_default
except CourseOverview.DoesNotExist:
TASK_LOGGER.exception(f'Could not find course: {course_key_str}')
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto nit: do we need to log the whole exception? I don't think it will tell us much more than the error message does.
And shouldn't we mark the task as failed?

Suggested change
TASK_LOGGER.exception(f'Could not find course: {course_key_str}')
TASK_LOGGER.error(f'Could not find course: {course_key_str}')
course_status.status = CourseLinksStatusChoices.FAILED
course_status.save()

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about marking the course_status as failed if it fails?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice catch! Updated.

@navinkarkera
Copy link
Contributor Author

@pomegranited Thanks for the review! About the course imports, I am working on it (I did mention it in the description but it was probably after you started reviewing it).

@navinkarkera navinkarkera force-pushed the navin/fal-4004/link-table branch from 54565b7 to 60b4f43 Compare January 17, 2025 06:28
Copy link
Contributor Author

@navinkarkera navinkarkera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Links should be created on course imports now.

except ObjectDoesNotExist:
log.exception("Library block not found!")
lib_component = None
authoring_api.update_or_create_entity_link(
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is possible that a course is imported which has an xblock that an upstream which does not exist in this instance. We want to still save them and link them when the library block is imported (this is not implemented yet, I think I'll do it as part of a future task).

Comment on lines +213 to +224
@receiver(XBLOCK_CREATED)
@receiver(XBLOCK_UPDATED)
def create_or_update_upstream_downstream_link_handler(**kwargs):
"""
Automatically create or update upstream->downstream link in database.
"""
xblock_info = kwargs.get("xblock_info", None)
if not xblock_info or not isinstance(xblock_info, XBlockData):
log.error("Received null or incorrect data for event")
return

create_or_update_xblock_upstream_link.delay(str(xblock_info.usage_key))
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be fixed now.

try:
course_name = CourseOverview.get_from_id(course_key).display_name_with_default
except CourseOverview.DoesNotExist:
TASK_LOGGER.exception(f'Could not find course: {course_key_str}')
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here.

Copy link
Contributor

@pomegranited pomegranited left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 Works great @navinkarkera !

  • I tested this using the PR test instructions, including importing a course with links.
  • I read through the code
  • I checked for accessibility issues by using my keyboard to navigate N/A
  • Includes documentation -- code comments
  • User-facing strings are extracted for translation

store = modulestore()
course_key = CourseKey.from_string(course_key_str)
course_status.status = LearningContextLinksStatusChoices.PROCESSING
course_status.save()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: do you want to update the updated field when the status changes too?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes! Created an API function in learning to help.

try:
course_name = CourseOverview.get_from_id(course_key).display_name_with_default
except CourseOverview.DoesNotExist:
TASK_LOGGER.exception(f'Could not find course: {course_key_str}')
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about marking the course_status as failed if it fails?

@navinkarkera navinkarkera force-pushed the navin/fal-4004/link-table branch from 5d937b0 to 53ed09f Compare January 20, 2025 04:57
@navinkarkera navinkarkera requested a review from ormsbee January 20, 2025 05:56
@navinkarkera navinkarkera marked this pull request as ready for review January 20, 2025 05:56
Comment on lines 30 to 33
# Recreate upstream links for all courses.
$ ./manage.py cms recreate_upstream_links --all
# Force recreate links for all courses including completely processed ones.
$ ./manage.py cms recreate_upstream_links --all
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: The command is repeated the same but with a different description

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated 👍

@navinkarkera navinkarkera force-pushed the navin/fal-4004/link-table branch from 53ed09f to d3fbc1e Compare January 21, 2025 04:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
open-source-contribution PR author is not from Axim or 2U
Projects
Status: Waiting on Author
Development

Successfully merging this pull request may close these issues.

4 participants