Testing with feature flags
To run a specific test with a feature flag enabled you can use the QA::Runtime::Feature
class to
enable and disable feature flags (via the API).
Note that administrator authorization is required to change feature flags. QA::Runtime::Feature
automatically authenticates as an administrator as long as you provide an appropriate access
token via GITLAB_QA_ADMIN_ACCESS_TOKEN
(recommended), or provide GITLAB_ADMIN_USERNAME
and GITLAB_ADMIN_PASSWORD
.
feature_flag
RSpec tag
Please be sure to include the feature_flag
tag so that the test can be skipped on the appropriate environments.
Optional metadata:
name
- Format:
feature_flag: { name: 'feature_flag_name' }
- Used only for informational purposes at this time. It should be included to help quickly determine what feature flag is under test.
scope
- Format:
feature_flag: { name: 'feature_flag_name', scope: :project }
- When
scope
is set to:global
, the test will be skipped on all live .com environments. This is to avoid issues with feature flag changes affecting other tests or users on that environment. - When
scope
is set to any other value (such as:project
,:group
or:user
), or if noscope
is specified, the test will only be skipped on canary, production, and pre-production. This is due to the fact that administrator access is not available there.
WARNING: You are strongly advised to first try and enable feature flags only for a group, project, user, or feature group.
- If a global feature flag must be used, it is strongly recommended to apply
scope: :global
to thefeature_flag
metadata. This is, however, left up to the SET's discretion to determine the level of risk.- For example, a test uses a global feature flag that only affects a small area of the application and is also needed to check for critical issues on live environments.
In such a scenario, it would be riskier to skip running the test. For cases like this,
scope
can be left out of the metadata so that it can still run in live environments with administrator access, such as staging.
- For example, a test uses a global feature flag that only affects a small area of the application and is also needed to check for critical issues on live environments.
In such a scenario, it would be riskier to skip running the test. For cases like this,
Note on requires_admin
: This tag should still be applied if there are other actions within the test that require administrator access that are unrelated to updating a
feature flag (like creating a user via the API).
The code below would enable a feature flag named :feature_flag_name
for the project
created by the test:
RSpec.describe "with feature flag enabled", feature_flag: {
name: 'feature_flag_name',
scope: :project
} do
let(:project) { Resource::Project.fabricate_via_api! }
around do |example|
Runtime::Feature.enable(:feature_flag_name, project: project)
example.run
Runtime::Feature.disable(:feature_flag_name, project: project)
end
it "feature flag test" do
# Execute the test with the feature flag enabled.
# It will only affect the project created in this test.
end
end
Note that the enable
and disable
methods first set the flag and then check that the updated
value is returned by the API.
Similarly, you can enable a feature for a group, user, or feature group:
group = Resource::Group.fabricate_via_api!
Runtime::Feature.enable(:feature_flag_name, group: group)
user = Resource::User.fabricate_via_api!
Runtime::Feature.enable(:feature_flag_name, user: user)
feature_group = "a_feature_group"
Runtime::Feature.enable(:feature_flag_name, feature_group: feature_group)
If no scope is provided, the feature flag is set instance-wide:
# This will affect all users!
Runtime::Feature.enable(:feature_flag_name)
Working with selectors
A new feature often replaces a vue
component or a haml
file with a new one.
In most cases, the new file or component is accessible only with a feature flag.
This approach becomes problematic when tests must pass both with, and without,
the feature flag enabled. To ensure tests pass in both scenarios:
- Create another selector inside the new component or file.
- Give it the same name as the old one.
Selectors are connected to a specific frontend file in the page object,
and checked for availability inside our qa:selectors
test. If the mentioned selector
is missing inside that frontend file, the test fails. To ensure selectors are
available when a feature flag is enabled or disabled, add the new selector to the
page object, leaving the old selector in place.
The test uses the correct selector and still detects missing selectors.
If a new feature changes an existing frontend file that already has a selector, you can add a new selector with the same name. However, only one of the selectors displays on the page. You should:
- Disable the other with the feature flag.
- Add a comment in the frontend file to delete the old selector from the frontend file and from the page object file when the feature flag is removed.
Example before
# This is the link to the old file
view 'app/views/devise/passwords/edit.html.haml' do
# The new selector should have the same name
element :password_field
...
end
Example after
view 'app/views/devise/passwords/edit.html.haml' do
element :password_field
...
end
# Now it can verify the selector is available
view 'app/views/devise/passwords/new_edit_behind_ff.html.haml' do
# The selector has the same name
element :password_field
end
Working with resource classes
If a resource class must behave differently when a feature flag is active, toggle a variable with the name of the feature flag inside the class. This variable and condition ensure all actions are handled appropriately.
You can set this variable inside the fabricate_via_api
call. For a consistent approach:
- Use an
activated
check, not a deactivated one. - Add the word
activated
to the end of a variable's name. - Inside the
initialize
method, set the variable's default value.
For example:
def initialize
name_of_the_feature_flag_activated = false
...
end
Cleanup
After the feature flag is removed, clean up the resource class and delete the variable. All methods should use the condition procedures of the now-default state.
Managing flakiness due to caching
All application settings, and all feature flags, are cached inside GitLab for one minute. All caching is disabled during testing, except on static environments.
When a test changes a feature flag, it can cause flaky behavior if elements are visible only with an active feature flag. To circumvent this behavior, add a wait for elements behind a feature flag.
Running a scenario with a feature flag enabled
It's also possible to run an entire scenario with a feature flag enabled, without having to edit existing tests or write new ones.
Please see the QA README for details.
Confirming that end-to-end tests pass with a feature flag enabled
End-to-end tests should pass with a feature flag enabled before it is enabled on Staging or on GitLab.com. Tests that need to be updated should be identified as part of quad-planning. The relevant counterpart Software Engineer in Test is responsible for updating the tests or assisting another engineer to do so. However, if a change does not go through quad-planning and a required test update is not made, test failures could block deployment.
Automatic test execution when a feature flag definition changes
There are two ways to confirm that end-to-end tests pass:
- If a merge request adds or edits a feature flag definition file,
two
e2e:package-and-test
jobs (ee:instance-parallel
andee:instance-parallel-ff-inverse
) are included automatically in the merge request pipeline. One job runs the application with default feature flag state and another sets it to inverse value. The jobs execute the same suite of tests to confirm that they pass with the feature flag either enabled or disabled. - In some cases, if end-to-end test jobs didn't trigger automatically, or if it has run the tests with the default feature flag values (which might not be desired), you can create a Draft MR that enables the feature flag to ensure that all E2E tests pass with the feature flag enabled and disabled.
Troubleshooting end-to-end test failures with feature flag enabled
If enabling the feature flag results in E2E test failures, you can browse the artifacts in the failed pipeline to see screenshots of the failed tests. After which, you can either:
- Identify tests that need to be updated and contact the relevant counterpart Software Engineer in Test responsible for updating the tests or assisting another engineer to do so. However, if a change does not go through quad-planning and a required test update is not made, test failures could block deployment.
- Run the failed tests locally with the feature flag enabled. This option requires considerable amount of setup, but you'll be able to see what the browser is doing as it's running the failed tests, which can help debug the problem faster. You can also refer to the Troubleshooting Guide for E2E tests for support for common blockers.
Test execution during feature development
If an end-to-end test enables a feature flag, the end-to-end test suite can be used to test changes in a merge request
by running the e2e:package-and-test
job in the merge request pipeline. If the feature flag and relevant changes have already been merged, you can confirm that the tests
pass on the default branch. The end-to-end tests run on the default branch every two hours, and the results are posted to a
Test Session Report, which is available in the testcase-sessions project.
If the relevant tests do not enable the feature flag themselves, you can check if the tests will need to be updated by opening a draft merge request that enables the flag by default via a feature flag definition file. That will automatically execute the end-to-end test suite. The merge request can be closed once the tests pass. If you need assistance to update the tests, please contact the relevant stable counterpart in the Quality department, or any Software Engineer in Test if there is no stable counterpart for your group.