Skip to content

Moderation

Moderation

Classifies if text is potentially harmful across multiple categories.

Endpoint

POST /v1/moderations

Request Body

ParameterTypeRequiredDescription
modelstringYesModel ID (e.g., text-moderation-latest)
inputstring/arrayYesText to moderate (string or array of strings)

Example

Terminal window
curl -X POST https://cryptgpt.co/v1/moderations \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "text-moderation-latest",
"input": "I want to hurt someone."
}'

Response

{
"id": "modr-abc123",
"model": "text-moderation-latest",
"results": [
{
"flagged": true,
"categories": {
"hate": false,
"hate/threatening": false,
"harassment": true,
"self-harm": false,
"sexual": false,
"sexual/minors": false,
"violence": true,
"violence/graphic": false
},
"category_scores": {
"hate": 0.001,
"hate/threatening": 0.000,
"harassment": 0.750,
"self-harm": 0.001,
"sexual": 0.000,
"sexual/minors": 0.000,
"violence": 0.850,
"violence/graphic": 0.001
}
}
]
}

Categories

CategoryDescription
hateContent expressing hate toward a group
hate/threateningHateful content with threats
harassmentHarassing or bullying content
self-harmContent promoting self-harm
sexualSexual content
sexual/minorsSexual content involving minors
violenceViolent content
violence/graphicGraphic violent content

A result is flagged if any category score exceeds 0.5.