A rate limit is a restriction on the number of requests a user or system can send to a server or API within a given time frame. It helps prevent overload, ensures fair resource distribution, and mitigates abuse (e.g., DDoS attacks or spam).
Fixed Window – A set number of requests within a fixed time window (e.g., max 100 requests per minute).
Sliding Window – A dynamic limit based on recent requests.
Token Bucket – Users get a certain number of "tokens" for requests, which regenerate over time.
Leaky Bucket – Requests are placed in a queue and processed at a controlled rate.
An API allows a maximum of 60 requests per minute per user.
A website blocks an IP after 10 failed logins within 5 minutes.
If you need to implement rate limits in web development, various techniques and tools are available, such as Redis, NGINX rate limiting, or middleware in frameworks like Laravel or Express.js.
A spider (also called a web crawler or bot) is an automated program that browses the internet to index web pages. These programs are often used by search engines like Google, Bing, or Yahoo to discover and update content in their search index.
Starting Point: The spider begins with a list of URLs to crawl.
Analysis: It fetches the HTML code of a webpage and analyzes its content, links, and metadata.
Following Links: It follows the links found on the page to discover new pages.
Storage: The collected data is sent to the search engine’s database for indexing.
Repetition: The process is repeated regularly to keep the index up to date.
Search engine optimization (SEO)
Price comparison websites
Web archiving (e.g., Wayback Machine)
Automated content analysis for AI models
Some websites use a robots.txt file to specify which areas can or cannot be crawled by a spider.
A crawler (also known as a web crawler, spider, or bot) is an automated program that browses the internet and analyzes web pages. It follows links from page to page and collects information.
Search Engines (e.g., Google's Googlebot) – Index web pages so they appear in search engine results.
Price Comparison Websites – Scan online stores for the latest prices and products.
SEO Tools – Analyze websites for technical errors or optimization potential.
Data Analysis & Monitoring – Track website content for market research or competitor analysis.
Archiving – Save web pages for future reference (e.g., Internet Archive).
Starts with a list of URLs.
Fetches web pages and stores content (text, metadata, links).
Follows links on the page and repeats the process.
Saves or processes the collected data depending on its purpose.
Many websites use a robots.txt file to control which content crawlers can visit or ignore.
An Internationalized Resource Identifier (IRI) is an extended version of a Uniform Resource Identifier (URI) that supports Unicode characters beyond the ASCII character set. This allows non-Latin scripts (e.g., Chinese, Arabic, Cyrillic) and special characters to be used in web addresses and other identifiers.
A-Z
, 0-9
, -
, .
, _
), IRIs allow characters from the entire Unicode character set.https://de.wikipedia.org/wiki/Überblick
https://de.wikipedia.org/wiki/%C3%9Cberblick
Ü
is encoded as %C3%9C
)IRIs are defined in RFC 3987 and are supported in modern web technologies like HTML5, XML, and RDF.
IRIs make the internet more linguistically inclusive by allowing websites and resources to be referenced using non-Latin characters, improving accessibility worldwide.
A backlink is a link from an external website that points to your own website. It’s like a recommendation or reference: when another website links to yours, it signals to search engines that your content might be relevant and trustworthy.
SEO Ranking Factor:
Backlinks are one of the most critical criteria for search engines like Google to determine a website's relevance and authority. The more high-quality backlinks a site has, the better its chances of ranking higher in search results.
Traffic Source:
Backlinks drive direct traffic to your site when users click on the link.
Reputation and Trust:
Links from well-known and trusted websites (e.g., news outlets or industry leaders) boost your site’s credibility.
DoFollow Backlinks:
These pass on "link juice" (link equity), which positively impacts SEO rankings.
NoFollow Backlinks:
These tell search engines not to follow the link. While they have less impact on rankings, they can still drive traffic to your site.
Create High-Quality Content:
Content that is helpful, interesting, or unique often gets linked by other websites.
Write Guest Posts:
Publish articles on other blogs or websites and include links to your own.
Broken Link Building:
Identify broken links on other websites and suggest replacing them with links to your content.
Networking and Collaborations:
Build partnerships with other website owners to exchange or gain backlinks.
In software development, semantics refers to the meaning or purpose of code or data. It focuses on what a program is supposed to do, as opposed to syntax, which deals with how the code is written.
a = 5
b = 0
print(a / b)
2. HTML Semantics:
<header> instead of <div> for a webpage header.
3. Semantic Models:
In software development, syntax refers to the formal rules that define how code must be written so that it can be correctly interpreted by a compiler or interpreter. These rules dictate the structure, arrangement, and usage of language elements such as keywords, operators, brackets, variables, and more.
Language-Specific Rules
Every programming language has its own syntax. What is valid in one language may cause errors in another.
Example:
Python relies on indentation, while Java uses curly braces.
Python:
if x > 0:
print("Positive Zahl")
Java:
if (x > 0) {
System.out.println("Positive Zahl");
}
Syntax Errors
Syntax errors occur when the code does not follow the language's rules. These errors prevent the program from running.
Example (Syntax error in Python):
print "Hello, World!" # Fehlende Klammern
3. Syntax vs. Semantics
4. Tools for Syntax Checking
Variable Naming: Variable names cannot contain spaces or special characters.
Variablenbenennung: Variablennamen dürfen keine Leerzeichen oder Sonderzeichen enthalten.
my_variable = 10 # korrekt
my-variable = 10 # Syntaxfehler
{ ... }
.
Duplicate Code refers to instances where identical or very similar code appears multiple times in a program. It is considered a bad practice because it can lead to issues with maintainability, readability, and error-proneness.
1. Exact Duplicates: Code that is completely identical. This often happens when developers copy and paste the same code in different locations.
Example:
def calculate_area_circle(radius):
return 3.14 * radius * radius
def calculate_area_sphere(radius):
return 3.14 * radius * radius # Identical code
2. Structural Duplicates: Code that is not exactly the same but has similar structure and functionality, with minor differences such as variable names.
Example:
def calculate_area_circle(radius):
return 3.14 * radius * radius
def calculate_area_square(side):
return side * side # Similar structure
3. Logical Duplicates: Code that performs the same task but is written differently.
Example:
def calculate_area_circle(radius):
return 3.14 * radius ** 2
def calculate_area_circle_alt(radius):
return 3.14 * radius * radius # Same logic, different style
1. Refactoring: Extract similar or identical code into a shared function or method.
Example:
def calculate_area(shape, dimension):
if shape == 'circle':
return 3.14 * dimension * dimension
elif shape == 'square':
return dimension * dimension
2. Modularization: Use functions and classes to reduce repetition.
3. Apply the DRY Principle: "Don't Repeat Yourself" – avoid duplicating information or logic in your code.
4. Use Tools: Tools like SonarQube or CodeClimate can automatically detect duplicate code.
Reducing duplicate code improves code quality, simplifies maintenance, and minimizes the risk of bugs in the software.
"Lines of Code" (LOC) is a software development metric that measures the number of lines written in a program or application. This metric is often used to gauge the size, complexity, and effort required for a project. LOC is applied in several ways:
Code Complexity and Maintainability: A high LOC count can suggest that a project is more complex or harder to maintain. Developers often aim to keep code minimal and efficient, as fewer lines typically mean fewer potential bugs and easier maintenance.
Productivity Measurement: Some organizations use LOC to evaluate developer productivity, though the quality of the code—rather than just quantity—is essential. A high number of lines could also result from inefficient solutions or redundancies.
Project Progress and Estimations: LOC can help in assessing project progress or in making rough estimates of the development effort for future projects.
While LOC is a simple and widely used metric, it has limitations since it doesn’t reflect code efficiency, readability, or quality.
Cyclomatic complexity is a metric used to assess the complexity of a program's code or software module. It measures the number of independent execution paths within a program, based on its control flow structure. Developed by Thomas J. McCabe, this metric helps evaluate a program’s testability, maintainability, and susceptibility to errors.
Cyclomatic complexity V(G)V(G) is calculated using the control flow graph of a program. This graph consists of nodes (representing statements or blocks) and edges (representing control flow paths between blocks). The formula is:
V(G)=E−N+2PV(G) = E - N + 2P
In practice, a simplified calculation is often used by counting the number of branching points (such as If, While, or For loops).
Cyclomatic complexity indicates the minimum number of test cases needed to cover each path in a program once. A higher cyclomatic complexity suggests a more complex and potentially error-prone codebase.
By measuring cyclomatic complexity, developers can identify potential maintenance issues early and target specific parts of the code for simplification and refactoring.