Data Science Roles
C3 AI provides the following predefined roles with permissions to perform release management tasks:
C3.JupyterLabC3.PowerDataScientistC3.DataScientistC3.Spark
Additionally, data science roles relate to the following data disclosure roles:
C3.DisclosureExemptC3.DisclosureRequestorC3.DisclosureReviewerC3.DataDisclosureAdmin
C3.JupyterLab
The typical persona for a user with the C3.JupyterLab role is someone who uses C3.JupyterLab for data analysis, visualization, development, prototyping, and scripting.
C3.JupyterLab permissions
The C3.JupyterLab role has the following capabilities:
- Access to JupyterLab-related Types:
- Access to file operations and the file system for content management.
- Access to packages and development mode for packages.
- Configure and manage UI settings.
- Access to all actions on machine learning pipelines.
- Manage Jupyter resource profiles, runtime information, Conda libraries, and package credentials.
- Manage code samples.
- Manage C3.JupyterLab APIs.
- Check and manage status of JupyterHub.
- Test Jupyter notebooks.
- Manage notebook cells.
- Manage Jupyter content.
- Handle errors and results for syncing Jupyter content.
C3.PowerDataScientist
The typical persona for a user with the C3.PowerDataScientist role is an advanced data scientist who performs data modeling, machine learning, and possibly development tasks. They likely lead large projects and optimize models.
C3.PowerDataScientist permissions
The PowerDataScientist role combines the DataScientist and DisclosureExempt roles and inherits capabilities from these two roles.
C3.PowerDataScientist inheritance
The PowerDataScientist role has the nested roles DataScientist and DisclosureExempt. Users with the PowerDataScientist role inherit all permissions from the DataScientist and DisclosureExempt roles.
For DisclosureExempt role capabilities, see the data disclosure roles section in this topic.
C3.DataScientist
The typical persona fora user with the C3.DataScientist role is someone who conducts experiments, builds predictive models, and interprets results to support business decisions. They work with various data sources, C3.JupyterLab and Spark tools, machine learning algorithms, and data visualizations.
C3.DataScientist permissions
The C3.JupyterLab role has the following capabilities:
C3.DataScientist inheritance
The C3.DataScientist role has the nested roles C3.JupyterLab, C3.DisclosureRequestor, and Spark. Users with the C3.DataScientist role inherit all permissions from the C3.JupyterLab, C3.DisclosureRequestor, and Spark roles.
For DisclosureRequestor role capabilities, see the data disclosure roles section in this topic.
C3.Spark
The typical persona for a user with the C3.Spark role is someone who specializes in distributed computing using Apache Spark. They process large-scale data, perform transformations, aggregations, and analyses using Spark APIs, such as PySpark and Spark SQL.
C3.Spark permissions
The C3.Spark role has the following capabilities:
- Start, stop, and configure Spark instances.
- Create, configure, and manage Spark sessions.
- Plan and run Spark operations
- Access to Spark data structures and methods:
- Read, cache and run SQL queries.
- Load, transform, group, and aggregate data.
- Manipulate series objects.
- Access to the following methods:
- Datetime, float64, categorical, int64, multi-level, timedelta indices, and general index management methods.
- Loc indexer methods for label-based indexing, and iloc indexer methods for position-based indexing.
- Datetime-specific, string manipulation, and resampling methods.
- Access to ExMachina for data visualization, ExMachina API, and ExMachina configuration settings.
Data disclosure roles
C3 AI provides the following data disclosure roles for data disclosure management.
C3.DisclosureExempt
The C3.DisclosureExempt role has the following capabilities:
- Have full data access.
- Bypass audit trails or logging mechanisms.
- Perform advance analytics, data mining, or statistical analysis on any dataset.
- Modify restricted configuration settings.
C3.DisclosureRequestor
The C3.DisclosureRequestor role has the following capabilities:
- Submit data requests for access to datasets or data segments.
- Participate in approval workflow for data disclosure requests.
- Maintain audit trails for data access and disclosure activities.
- Monitor and log data access to adhere to privacy and security protocols.
C3.DisclosureReviewer
The C3.DisclosureReviewer role has the following capabilities:
- Review, approve, or reject pending data disclosure requests.
- Maintain audit trails for data access and disclosure activities.
- Read-only access to some data for verification purposes.
- Create reports on request statuses, review times, and decision outcomes.
DisclosureAdmin
The DisclosureAdmin role has the following capabilities:
- Have full data access.
- Manage the data disclosure request process.
- Manage dataset access policies.
- Manage user accounts and access.
- Create reports on request statuses, review times, and decision outcomes.
- Maintain system configurations to ensure security and privacy compliance.
- Approve or reject data disclosure requests as needed.